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SUBSTRATE SPECIFICITY OF PROTEIN KINASES 



Background of the Invention 

The activity of cells is regulated by external signals that stimulate or inhibit 
5 intracellular events. The process by which stimulatory or inhibitory signals are transmitted 
into and within a cell to elicit an intracellular response is referred to as signal transduction. 
Proper signal transduction is essential for proper cellular function. Defects in various 
components of signal transduction pathways, from cell surface receptors to activators of gene 
transcription, account for a vast number of diseases, including numerous forms of cancer, 

10 vascular diseases and neuronal diseases. 

Protein kinases are enzymes involved in signal transduction which phosphorylate 
other proteins and/or themselves (autophosphorylation). Protein kinases involved in signal 
transduction in eukaryotic cells can be divided into three major groups based upon their 
substrate utilization: protein-tyrosine specific kinases (which phosphorylate substrates on 

15 tyrosine residues), protein-serine/threonine specific kinases (which phosphorylate substrates 
on serine and/or threonine residues) and dual-specificity kinases (which phosphorylate 
substrates on tyrosine, serine and/or threonine residues). Well over a hundred protein kinases 
have been identified to date and more are being identified at a very fast rate (about 10 to 30 
new kinases per year) through genetic and molecular biological approaches. There are 

20 conserved regions among all protein kinases, suggesting evolution from a common ancestor. 
For reviews on protein kinases see Kemp, B.E. (ed.), (1990 ) Peptides and Protein 
Phosphorylation. CRC Press Inc. and Hanks et al. (1988) Science 241:42-52. 

In order to insure fidelity in intracellular signal transduction cascades it is essential 
that each protein kinase have exquisite specificity in downstream targets. In some cases a 

25 kinase may have only a single substrate in a cell, but in general kinases appear to have a 
collection of targets that allow branching of an initial signal delivered to a cell in multiple 
directions in order to coordinate a set of events that occur in parallel for a given cellular 
response (see Roach, P.J. (1991) J. Biol Chem. 266:14139-14142). The substrate specificity 
of a protein kinase can be influenced by at least three general mechanisms that depend on the 

30 overall structure of the enzyme: 1 ) specific domains in certain protein kinases target them to 
specific locations in the cell, thereby restricting their substrate availability; 2) other domains 
in the kinase, distinct from the catalytic domain, may provide high affinity association with 
either the substrate or an adapter molecule that presents the substrate to the kinase; and 3) 
specificity is ultimately provided by the structure of the catalytic site of the protein kinase. 

35 Although the number of protein kinases that have been implicated in intracellular 

signaling is quite large, in only a few cases is there information about the sequence specificity 
of these kinases. This information has usually come from locating the phosphorylation sites 
on in vivo and/or in vitro substrates of the kinase. (For examples see Taylor ^et al., (1990) 
Ann Rev. Biochem. 52:971-1005; Cheng, et al. (1991) J. Biol Chem. 266:17919-17925; 
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Walsh et al. (1990) in Peptides and Protein Phos phorylation, (B.E. Kemp, ed.) CRC Press 
Inc.pp. 43-84; Gaehlen and Harrison (1990) in Peptides and Protein Phosp horylation, (BJE. 
Kemp, ed.) CRC Press Inc.pp. 239-254). By comparing the sequences of several 
phosphorylation sites for a kinase a consensus sequence for substrates of that kinase can be 
5 determined. These types of studies have demonstrated the importance of the primary amino 
acid sequence around the site of phosphorylation in determining the in vivo specificity of 
protein kinases. Synthetic peptides can be constructed based upon the consensus sequence 
motif of a known phosphorylation site and individual amino acids can then be replaced one 
by one in order to determine the importance of particular amino acids on the K M or Vjj^ of 
10 the phosphorylation reaction. 

However, there are severe limitations to this approach for determining the substrate 
specificity of a protein kinase. The procedure is quite expensive and laborious since each 
amino acid residue within a phosphorylation site must be altered individually to evaluate its 
importance. This approach does not necessarily identify all the residues critical for substrate 
15 specificity and, furthermore, an optimal substrate sequence is not likely to be determined 
unless each residue is changed to every other possible amino acid residue individually and 
then evaluated. For example, based upon an estimation of 9 to 12 amino acid residues of a 
substrate contacting the active site cleft of a kinase there would be approximately 1 .024 x 
10 13 (20 10 ) distinct peptides to consider. Moreover, in many cases this approach is not 
20 feasible because in vivo substrates for some kinase cannot be determined with certainty. For 
example, if an extracellular signal activates multiple kinases which phosphorylate multiple 
substrates or if a signal activates a cascade of kinases it is difficult to determine which kinase 
phosphorylates which substrate. An even more difficult problem in determining the substrate 
specificity of certain kinases is that their critical in vivo substrates are often proteins which 
25 are present in vivo in very low abundance and thus are not easily detectable by in vivo 

phosphorylation assays. Even relatively abundant substrates can be overlooked because of 
their high rates of dephosphorylation by closely-associated phosphatases. 

Thus, there is a need for an alternative method to that of isolating and examining the 
sequences of native substrates for determining the substrate specificity of a protein kinase. 
30 Consensus sequence motifs for the phosphorylation sites of many known protein kinases 
have not yet been determined because of the limitations to current approaches discussed 
above. Information on the substrate specificity of each protein kinase involved in signal 
transduction would provide insight into signal transduction mechanisms and could allow for 
the design of novel therapeutic agents based on the substrate specificities of different protein 
35 kinases. 



Summary o f the Invention 

The invention provides a method for determining an amino acid sequence motif for a 
phosphorylation site of a protein kinase based upon selection of a subpopulation of peptides 



WO 95/18823 



-3- 



PCTAJS95/00147 



from a degenerate peptide library that are substrates for a protein kinase. In the method, 
peptides within an oriented degenerate peptide library that can be substrates for a protein 
kinase are phosphoryated by the kinase, converting them to phosphopeptides. The 
phosphopeptides are then separated from the remaining non-phosphorylated peptides in the 
5 library, thereby isolating the subpopulation of peptides that are substrates for the kinase. This 
subpopulation of peptides is then sequenced and the relative abundance of each amino acid 
residue at each degenerate position of the peptides is determined. An amino acid sequence 
motif for the phosphorylation site of a protein kinase can be determined from the most 
abundant amino acid residues at each degenerate position of the peptides within the library. 
10 The peptides of the oriented degenerate peptide library can have the formula: 

(XaaVZaa-CXaa)™ 

wherein Zaa is a non-degenerate phosphorylatable amino acid selected from the group 
1 5 consisting of Ser, Thr and Tyr, Xaa is any amino acid and n and m are integers from 1-10 
inclusive. Alternatively, certain amino acids can be omitted from the degenerate positions of 
the peptides of the library such that Zaa is the only phosphorylatable amino acid in the 
peptides. Accordingly, in another embodiment, when Zaa is Ser or Thr, Xaa is any amino 
acid except Ser or Thr. Likewise, in another embodiment, when Zaa is Tyr, Xaa is any amino 
20 acid except Tyr. Additionally, non-degenerate amino acid residues can be added to the N- 
terminal and/or C-terminal ends of the peptides of the oriented degenerate peptide library. 

In one embodiment of the method of the invention, the phosphorylated peptides are 
separated from the non-phosphorylated peptides by binding the phosphorylated peptides to a 
ferric column. Alternatively, the peptides in the library can be thio-phosphorylated and the 
thio-phosphorylated peptides can be separated from non-phosphorylated peptides by binding 
the thio-phosphorylated peptides to a mercury column. 

The method of the invention can be used to determine an amino acid sequence motif 
for a phosphorylation site of any protein kinase, including protein-serine/threonine specific 
kinases, protein-tyrosine specific kinases and dual-specificity kinases. Preferred families of 
kinases include cyclic nucleotide-dependent kinases, cell cycle control kinases, src family 
kinases, epidermal growth factor receptor-like kinases and abl family kinases. 

The amino acid sequence motif of a phosphorylation site of a kinase, determined by 
the method of the invention, can be used to make peptide substrates for the kinase. The 
peptide substrates comprise amino acid sequences corresponding to the amino acid sequence 
motif, which represents the preferred phosphorylation site for the kinase. Thus, these peptide 
substrates have a very high affinity for the kinase. 

The peptide substrates of the invention can be incorporated into a larger protein to 
create a preferred phosphorylation site for a kinase within the protein. 
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The amino acid sequence motif of a phosphorylation site of a kinase, determined by 
the method of the invention, can also be used to make pseudosubstrates for the kinase. The 
pseudosubstrates comprise peptides having amino acid sequences corresponding to the amino 
acid sequence motif but in which the phosphorylatable amino acid residue is replaced by a 
5 non-phosphorylatable amino acid residue. Peptide analogs can also be designed based upon 
the amino acid sequence motifs provided by the invention. 

The peptide substrates and pseudosubstrates of the invention can be used to inhibit the 
activity of a kinase. The phosphorylation of a substrate by a particular kinase can be 
inhibited by contacting the kinase with an inhibitory amount of a peptides substrate or 
10 pseudosubstrate for the kinase, as provided by the invention. 

The invention provides the amino acid sequence motifs for the phosphorylation sites 
of protein kinase A, cell cycle control kinases (including cyclin B/p33cdc2 and cyclin 
A/p33CDK2), src family kinases (including pp60c-src and pp60v-src), epidermal growth 
factor receptor (EGFR), p 92 c - f P s/ fes lck, c-abl, platelet derived growth factor receptor 
15 (PDGFR), fibroblast growth factor receptor (FGFR), insulin receptor, casein kinase II, the 
cell-cycle regulated protein kinase NIMA, phosphorylase kinase, Cam kinase II and Erk 1 . 
Peptide substrates and pseudosubstrates for these kinases, based upon the amino acid 
sequence motifs, are encompassed by the invention. 

20 Description of the Drawings 

Figure J is a graph depicting the elution profile of non-phosphorylated and 
phosphorylated peptides from a ferric column, demonstrating quantitative separation of non- 
phosphoiylated peptides from phosphorylated peptides. The presence of non-phosphorylated 
peptides was determined by absorbance at 280nm and the presence of phosphorylated 

25 peptides was determined by detection of an incorporated radiolabel. The phosphorylated 
peptides elute predominantly in fractions 11-14. 

Figure 2 is eight bar graphs depicting the relative abundance of each amino acid 
residue at each degenerate position of peptides from an oriented degenerate peptide library 
which are phosphorylated by protein kinase A. Each bar graph indicates the relative 

30 abundance of the IS amino acid residues at a given cycle of sequencing. Cycle 7, the site of 
phosphorylation of the peptide, is not shown. Therefore, boxes A, B, C and D indicate amino 
acid preferences at -4, -3 , -2 and -1 N-terminal of the phosphorylation site and boxes E, F, G 
and H indicate amino acid preferences at +1 , +2, +3, and +4 C-terminal to the 
phosphorylation site. The amino acid residues are listed on the X-axis and relative 

35 abundance is on the Y-axis. The bars represent average values from two independent 
experiments. 

Figure 3 is eight bar graphs depicting the relative abundance of each amino acid 
residue at each degenerate position of peptides phosphorylated by cyclin B/pSS 0 ^ 2 . The 
data is plotted as described for figure 2. 
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Figure 4 is eight bar graphs depicting the relative abundance of each amino acid 
residue at each degenerate position of peptides phosphorylated by cyclin A/p33CDK2 The 
data is plotted as described for figure 2. 

Figure 5 is eight bar graphs depicting the relative abundance of each amino acid 
5 residue at each degenerate position of peptides phosphorylated by products of middle 
T/ppeO 0 - 510 and pp60 v * src . The data is plotted as described for figure 2. The solid bars 
indicate the relative abundance of amino acids for products of pp60 v " src and the stippled bars 
indicate the relative abundance of amino acids for products of middle T/ppeO 0 " 51 ^. 

Figure 6 is eight bar graphs depicting the relative abundance of each amino acid 
10 residue at each degenerate position of peptides phosphorylated by the epidermal growth 
factor receptor. The data is plotted as described for figure 2. 

Figure 7 is eight bar graphs depicting the relative abundance of each amino acid 
residue at each degenerate position of peptides phosphorylated by the pp95 c -*P s /fe s . The 
data is plotted as described for figure 2. 
15 Figure 8A-B are graphs depicting the in vitro phosphorylation of synthetic src and abl 

optimal peptides by the abl protein tyrosine kinase (panel A) or the src protein tyrosine kinase 
(panel B). 

Figure 9 is a bar graph depicting the binding of the abl and Ick SU2 domains to 
phosphopeptides phosphorylated by either the abl protein tyrosine kinase or the lck protein 
20 tyrosine kinase. 

Figure 10 is a bar graph depicting the phosphorylation of optimal peptide substrates 
for src, abl or EGF receptor by either wild-type RET receptor protein tyrosine kinase or 
mutant (MEN 2B) RET receptor protein tyrosine kinase. 

25 Detailed Description of the Invention 

The following standard three-letter and one-letter abbreviations for amino acids are 
used throughout the application: 



alanine 


Ala 


A 


arginine 


Arg 


R 


aspartic acid 


Asp 


D 


asparagine 


Asn 


N 


cysteine 


Cys 


C 


glutamic acid 


Glu 


E 


glutamine 


Gin 


Q 


glycine 


Gly 


G 


histidine 


His 


H 


iso leucine 


He 


I 


leucine 


Leu 


L 


lysine 


Lys 


K 


methionine 


Met 


M 


phenylalanine 


Phe 


F 
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proline 


Pro 


P 


serine 


Ser 


S 


threonine 


Thr 


T 


tryptophan 


Trp 


W 


tyrosine 


Tyr 


Y 


valine 


Val 


V 



This invention pertains to the substrate specificity of protein kinases and to peptides 
which are substrates for protein kinases. The invention provides a method that allows for die 

1 0 identification of an amino acid sequence motif for the phosphorylation site of a specific 
protein kinase without having to identify, isolate and compare native substrates for the 
kinase. The method of the invention is based upon selection of a subpopulation of peptides 
from a degenerate peptide library that are substrates for a protein kinase. In the method, the 
peptides within a peptide library that can be substrates for a protein kinase are phosphoryated 

15 by the kinase, converting them to phosphopeptides. The phosphopeptides are then separated 
from the remaining non-phosphorylated peptides in the library, thereby isolating the 
subpopulation of peptides that are substrates for the kinase. This subpopulation of peptides is 
then sequenced and the relative abundance of each amino acid residue at each degenerate 
position of the peptides is determined. An amino acid sequence motif for the 

20 phosphorylation site of a protein kinase can be determined from the most abundant amino 
acid residues at each degenerate position of the peptides. This method has the advantage that 
it can be used to determine a phosphorylation site motif for any kinase, regardless of whether 
native substrates for that kinase have been identified. Furthermore, since the method 
involves selection of peptides which are phosphorylated most readily by a protein kinase, the 

25 determined amino acid sequence motif represents the optimal phosphorylation site for that 
kinase. 

Based upon the amino acid sequence motif of the phosphorylation site of a kinase 
determined by the method of the invention, peptides can be made which are substrates for 
that particular protein kinase. Peptides comprising the most preferred amino acid residues of 

30 a sequence motif represent optimal substrates for the protein kinase. The peptide substrates 
of the invention can be used to detect and quantitate protein kinases. Furthermore, due to 
their high affinity for the catalytic site of a protein kinase, the peptide substrates of the 
invention can be used to competitively inhibit the activity of a protein kinase. 
Pseudosubstrates and peptide analogs can also be designed based upon the amino acid 

35 sequences of the peptide substrates of the invention. 

The invention provides a method for determining an amino acid sequence motif for a 
phosphorylation site of a protein kinase. The phrase "an amino acid sequence motif for a 
phosphorylation site of a protein kinase" is intended to describe one or more amino acid 
sequences which represent a consensus sequence motif for the region including and 

40 surrounding an amino acid residue which is phosphorylated by a protein kinase. The method 
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for determining an amino acid sequence motif for the phosphorylation site of a protein kinase 
first involves contacting a protein kinase to be examined with an oriented degenerate peptide 
library composed of non-phosphorylated peptides having a phosphorylatable amino acid 
residue at a fixed non-degenerate position. For a given kinase, only a small subset of the 
5 peptides have amino acids surrounding the phosphorylatable residue that create a preferred 
sequence for binding to the kinase and phosphorylation by the kinase. The protein kinase is 
allowed to phosphoiylate the subset of peptides that are preferred substrates for the kinase, 
thereby converting this population of peptides to a population of phosphorylated peptides. 
Next, the population of phosphorylated peptides is separated from the remaining non- 
10 phosphorylated peptides. Finally, the mixture of phosphorylated peptides is subjected to 

sequencing (e.g., automated sequencing) and the abundance of each amino acid determined at 
each cycle of sequencing is compared to the abundance of each amino acid at the same cycle 
in the starting peptide library. Since the phosphorylated residue is at the same position in 
every peptide of the library (e.g., residue 7 from the N-terminus), the most abundant amino 
1 5 acid(s) at a particular cycle indicate the amino acid(s) preferred by the kinase at that position 
relative to the site of phosphorylation. 

The term "protein kinase" as used herein is intended to include all enzymes which 
phosphorylate an amino acid residue within a protein or peptide. Preferred protein kinases 
for use in the method of the invention include protein-serine/threonine specific protein 
20 kinases, protein-tyrosine specific kinases and dual-specificity kinase. Other protein kinases 
which can be used in the method of the invention include protein-cysteine specific kinases, 
protein-histidine specific kinases, protein-lysine specific kinases, protein-aspartic acid 
specific kinases and protein-glutamic acid specific kinases. A protein kinase used in the 
method of the invention can be a purified native protein kinase, for example purified from a 
25 biological source. Some purified protein kinases are commercially available (e.g., protein 
kinase A from Sigma Chemical Co.). Alternatively, a protein kinase used in the method of 
the invention can be a recombinantly produced protein kinase. Many protein kinases have 
been molecularly cloned and characterized and thus can be expressed recombinantly by 
standard techniques. A recombinantly produced protein kinase which maintains proper 
kinase function can be used in the method of the invention. If the recombinant protein kinase 
to be examined is a eukaryotic protein kinase, it is preferable that the protein kinase be 
recombinantly expressed in a eukaryotic expression system to ensure proper post- 
translational modification of the protein kinase. Many eukaryotic expression systems (e.g., 
baculovirus and yeast expression systems) are known in the art and standard procedures can 
be used to express a protein kinase recombinantly. A recombinantly produced protein kinase 
can also be a fusion protein (i.e., composed of the protein kinase and a second protein or 
peptide, for example a protein kinase fused to glutathione-S-transferase (GST)) as long as the 
fusion protein retains the catalytic activity of the non-fused form of the protein kinase. 
Furthermore, the term "protein kinase" is intended to include portions of native protein 
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kinases which retain catalytic activity. For example, asubunit of a multisubunit kinase 
which contains the catalytic domain of the protein kinase can be used in the method of the 
invention. 

The term "degenerate peptide library" is intended to describe populations of peptides 
in which different amino acid residues are present at the same position in different peptides 
within the library. For example, a population of peptides of 10 amino acids in length in 
which the amino acid residue at position 5 of the peptides can be any one of the twenty amino 
acids would be a degenerate peptide library. A position within the peptides which is 
occupied by different amino acids in different peptides is referred to herein as a "degenerate 
position"; a position within the peptides which is occupied by the same amino acid in 
different peptides is referred to herein as a "non-degenerate position". The "oriented 
degenerate peptide library" used in the method of the invention is composed of non- 
phosphorylated peptides which have a phosphorylatable amino acid residue at a fixed, non- 
degenerate position. This means that the peptides contained within the library all have the 
same phosphorylatable amino acid residue at the same position within the peptides. The term 
"phosphorylatable amino acid residue" is intended to include those amino acid residues which 
can be phosphorylated by a protein kinase. Phosphorylatable amino acid residues include 
serine, threonine and tyrosine, or phosphorylatable analogs thereof. 

In addition to the phosphorylatable non-degenerate amino acid residue, the peptides of 
the oriented degenerate peptide library have other amino acid residues which are degenerate. 
Preferably, the amino acid residues on either side of the phosphorylatable amino acid residue 
are degenerate (i.e, immediately N-terminal and C-terminal to the phosphorylated residue), 
thus enabling one to determine a phosphorylation site motif for the region surrounding the 
phosphorylated residue. For example, four amino acid residues located on each side of the 
phosphorylatable amino acid residue can be degenerate (e.g., at positions -4, -3, -2, -1, +1, 
+2, +3 and +4 relative to a phosphorylated amino acid residue at position 0 can be 
degenerate). The degenerate positions in the peptides of an oriented degenerate peptide 
library can be created such that any one of the twenty amino acids can occupy those 
positions. However, in order to reduce "background" phosphorylation events (i.e, 
phosphorylation events at a residue other than the fixed phosphorylatable residue) it is 
preferred that the degenerate positions not contain amino acid residues that can be 
phosphorylated by the particular protein Idnase being examined. Thus, for protein-' 
serine/threonine kinases it is preferred that the degenerate positions not contain serine or 
threonine. Likewise, for a protein-tyrosine specific kinase, it is preferred that the degenerate 
positions not contain tyrosine and for a dual-specificity kinase is it preferred that the 
degenerate positions not contain serine, threonine or tyrosine. 

However, an oriented degenerate peptide library which contains additional 
phosphorylatable residues to the fixed residue can be used and it is possible to estimate the 
degree of background which will occur when using such a library and take this background 
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into consideration when evaluating the results of the library screening. For example, consider 
the problem of including Tyr at the degenerate positions of a library having 8 degenerate 
positions that is to be used with a protein-Tyr kinase. The kinase may phosphorylate the 
tyrosine residues at the degenerate positions as well as the Tyr at the fixed position. The 
5 theoretical fraction of peptides with Tyr at one of the degenerate sites in addition to the fixed 
site (assuming a degeneracy of 20 at each residue) is 8 x (l/20)=0.4. Thus, about 40% of the 
phosphopeptides purified are likely to be phosphorylated at the wrong residue. It is possible 
to get a good estimate of the extent of this problem since those peptides that are 
phosphorylated at sites other than the fixed residue will have detectable non-phosphorylated 

10 Tyr at the sequencing cycle corresponding to the fixed residue when the sequencing is 

performed on the mixture. Although these phosphopeptides will cause a background problem 
when the data are plotted, the problem is less serious than initially implicated from the 40% 
contamination assumed. For example, consider the case in which the protein-tyrosine kinase 
being evaluated phosphorylates the sequence Glu-Xxx-Tyr and the fixed Tyr is at position 7 

15 in the peptides. Those peptides with Glu at position 5 will be preferentially phosphorylated. 
However, peptides with Glu at position 3 and Tyr at position 5 will also be phosphorylated 
and appear in the mixture. Similarly, those with Glu at 4 and Tyr at 6, Glu at 6 and Tyr at 8, 
Glu at 8 and Tyr at 10 and Glu at 9 and Tyr at 1 1 will also be selected. Each of these 
subfamilies of peptides is far less abundant than the group with Tyr fixed at position 7 (in 

20 theory, l/20th). Thus, the expected result is that Glu will be very abundant at cycle 5 but will 
also be somewhat elevated (l/20th as much as in cycle 5) at cycles 3, 4, 6, 8, and 9. In 
general this background is unlikely to be a problem. The importance of re-evaluating protein 
kinases with phosphorylatable residues at the degenerate positions is that Ser, Thr or Tyr 
residues upstream or downstream of the phosphorylated residue are already known to be 

25 important for some protein kinases. 

Additionally, certain amino acid residues may be omitted from a degenerate position 
in a degenerate peptide library for practical reasons. For example, tryptophan and cysteine 
residues may be omitted from the degenerate positions because there can be problems with 
detecting these residues during amino acid sequencing and cysteine residues may cause 

30 peptide dimer formation. However, a peptide library containing Trp and Cys at a degenerate 
position can be used in the method of the invention. For example, a "second generation" 
library can be made based upon the amino acid sequence motif determined from initially 
screening a protein kinase with a peptide library which does not contain Trp or Cys at 
degenerate positions. At each degenerate position of this library, the peptides would have 

35 either the preferred amino acid residue determined from the initial screening or Cys or Trp. 
Thus, in this library, rather than being 1 of 20 possible residues at the degenerate position, 
Cys and Trp would be 1 of 3 possible residues at the degenerate position. Therefore, Trp and 
Cys will have a much stronger signal during amino acid sequencing of the peptides, thereby 
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allowing for detection of these residues and for evaluation of their influence on the amino 
acid sequence motif of the phosphorylation site. 

A preferred type of oriented degenerate peptide library for use in the method of the 
invention is a soluble synthetic peptide library. The term "soluble synthetic peptide library" 
5 is intended to mean a population of peptides which are constructed by in vitro chemical 
synthesis, for example using an automated peptide synthesizer, and which are not connected 
to a solid support such as a bead or a cell. For general descriptions of the construction of 
soluble synthetic peptide libraries see for example Houghten, R.A., et ah, (1991) Nature 
254:84-86 and Houghten, R.A., et ai., (1992) BioTechniques 12:412-421. Standard 
10 techniques for in vitro chemical synthesis of peptides are known in the art For example, 
peptides can be synthesized by (benzotriazolyloxy)tris(dimethylamino)-phosphonium 
hexafluorophosophate (BOP)/l-hydroxybenzotriazole coupling protocols. Automated 
peptide synthesizers are commercially available (e.g., Milligen/Biosearch 9600). To create 
degenerate positions within peptides of a soluble synthetic peptide library, two approaches 

15 can be used. A preferred approach is to divide the resin upon which the peptides are 

synthesized into equivalent portions and then couple each aliquot to a different amino acid 
residue to create a degenerate position. After this coupling, the resin aliquots are recombined 
and the procedure is repeated for each degenerate position. This approach results in 
approximately equivalent representation of each different amino acid residue at the 

20 degenerate position. Alternatively, a mixture of different amino acid residues can be added to 
a coupling step to create a degenerate position. However, different amino acid residues have 
different coupling efficiencies and therefore if equal amounts of each amino acid are used, 
each amino acid residue may not be equivalently represented at the degenerate position. The 
different coupling efficiencies of different amino acids can be compensated for by using a 

25 "weighted" mixture of amino acids at a coupling step, wherein amino acids with lower 

coupling efficiencies are present in greater abundance than amino acids with higher coupling 
efficiencies. A procedure for constructing a soluble synthetic degenerate peptide library 
which can be used in the method of the invention is described in detail in Example 1 . 

An alternative type of oriented degenerate peptide library which can be used in the 

30 method of the invention is a solid-support bound peptide library. The term "solid-support 
bound peptide library" is intended to mean a population of peptides which are connected to a 
solid support such as a bead or plastic pin. For general descriptions of the construction of 
solid-support bound peptide libraries see for example Geysen, H.M., et al. (1986) Mol 
Immunol 22:709-715; Lam, K.S., et al. (1991) Nature 254:82-84; and Pinilla, C, et al. 

35 (1992) BioTechniques 12:901-905. For this type of library, the peptides are synthesized 

attached to the solid support, such as a bead, and degenerate positions are created by splitting 
the population of beads, coupling different amino acids to different subpopulations and 
recombining the beads. The final product is a population of beads each carrying many copies 
of a single unique peptide. Thus, this approach has been termed "one bead/one peptide". 
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Another alternative type of oriented degenerate peptide library which can be used in 
the method of the invention is a phage expression library. The term "phage expression 
library" is intended to mean a population of filamentous bacteriophage particles which 
express a library of peptides on their surface wherein each phage particle expresses a different 
5 peptide. For general descriptions of the construction of phage expression libraries see for 
example Scott, J.K and Smith, G.P. (1990) Science 242:386-390; Cwirla, S.E., et al. (1990) 
Proc. Natl Acad. ScL USA 52:6378-6382; and Felicia, F., et al. (1991) J. Mol Biol 222:301- 
310. A phage expression library is based upon the expression by the phage of peptides 
encoded by nucleic acid introduced into the phage. Degenerate positions in the peptides arc 

10 created by inserting degenerate nucleic acid sequences, i.e., nucleic acids which have 

different nucleotides at the same position in different nucleic acid molecules. Degenerate 
nucleic acid sequences can be made by standard techniques known in the art. When using a 
phage expression library, the amino acid sequences of peptides selected during a screening 
process are determined by sequencing the inserted nucleic acid and thereby deducing the 

IS amino acid sequence of the encoded peptides. 

A soluble synthetic peptide library is preferred for use in the method of the invention 
because the bulk population of isolated phosphorylated peptides can be sequenced 
simultaneously, thus directly providing information on the relative abundance of different 
amino acid residues at each degenerate position within the population of peptides. In 

20 contrast, if a solid-support bound library or a phage expression library is used, each isolate 
gives only the single amino acid sequence of that isolated peptide and therefore many isolated 
peptides must be individually sequenced before one can arrive at a consensus motif. Even 
then, it is not possible to predict the order of affinities of the various peptides isolated unless 
they are individually synthesized and compared in binding experiments. One cannot be 

25 certain that one has not missed the best possible motif because not enough isolated peptides 
were sequenced. In addition, while the soluble synthetic oriented peptide library provides 
predictions about substitutions that would severely reduce the affinity of the peptide, no such 
information can be obtained from a solid-support bound library or a phage expression library. 
It has been estimated that approximately 9 to 12 amino acids of a substrate peptide are 

30 likely to contact the active site cleft of a kinase, based upon mutagenesis studies of known 
consensus motifs for phosphorylation sites of certain protein kinases and from X-ray 
crystallographic studies of peptides bound to protein kinases. Accordingly, an oriented 
degenerate peptide library can be used in which the degenerate residues encompass the region 
likely to contact the active site cleft of the kinase. The number of amino acids residues 

35 surrounding the phosphorylated residue of a substrate that influence the substrate specificity 
of a kinase is likely to differ for different kinases (again based upon mutagenesis studies of 
known consensus motifs for certain kinases). Thus, it is expected that oriented degenerate 
peptide libraries which will be useful in the method of the invention may have as few as one 
degenerate amino acid residue on either side of the phosphorylated residue or as many as ten 
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degenerate amino acid residues on either side of the phosphorylated residue. A preferred 
number of degenerate residues on either side of the phosphorylation site is four 
(corresponding to positions -1, -2, -3 -4, +1, +2, +3 and +4 relative to the phosphorylated 
residue). Alternative to having an equal number of degenerate residues on either side of the 
5 phosphorylated residue, one can use a library which has unequal numbers of degenerate 
residues on either side of the phosphorylated residue (e.g., 2 on one side and 4 on the other, 
etc.). The diversity of the peptide library (i.e., the number of different peptides contained 
within the library) is a function of the number of degenerate residues: the greater the number 
of degenerate residues the greater the diversity. For example, a library in which only 2 
10 positions are degenerate and any amino acid can be at these degenerate positions would 

represent 400 unique peptides (20 2 ) whereas a library in which 8 positions are degenerate and 
any amino acid can be at these positions would represent approximately 2.5 x 10 10 unique 
peptides (20*). 

In one embodiment of the invention, the oriented degenerate peptide library is 
1 5 composed of peptides having a formula: 

(Xaa)n-Zaa-(Xaa) m (SEQ ID NO: 1 ) 

wherein Zaa is a non-degenerate phosphorylatable amino acid selected from Ser, Thr and Tyr, 

20 Xaa is any amino acid and n and m are integers from 1-10 inclusive. Thus, (Xaa) n and 

(Xaa^ can be degenerate residues and there are between 1 and 1 0 residues (not all of which 
are required to be degenerate) on either side of the non-degenerate phosphorylatable residue. 

In a preferred embodiment of the invention, the phosphorylatable amino acid residue 
at the fixed non-degenerate position is the only phosphorylatable amino acid residue in the 

25 non-phosphorylated peptides comprising the peptide library. Peptide libraries can be 
constructed in which the type of amino acid residue(s) which a kinase phosphorylates is 
omitted at the degenerate position(s). Accordingly the invention provides oriented 
degenerate peptide libraries composed of peptides having a formula: (Xaa) n -Zaa-(Xaa) m 
that have certain amino acids omitted from the degenerate positions). In the case where the 

30 protein kinase is a protein-serine/threonine specific kinase, the library comprises peptides 
wherein Zaa is a non-degenerate phosphorylatable amino acid selected from Ser and Thr and 
Xaa is any amino acid except Ser and Thr. In the case where the protein kinase is a protein- 
tyrosine specific kinase, the library comprises peptides wherein Zaa is Tyr and Xaa is any 
amino acid except Tyr. In the case where the protein kinase is a dual-specificity kinase, a 

35 protein-serine/threonine specific kinase or a protein-tyrosine specific kinase, the library 
comprises peptides wherein Zaa is a non-degenerate phosphorylatable amino acid selected 
from Ser, Thr and Tyr, and Xaa is any amino acid except Ser, Thr and Tyr. 

A peptide library comprising peptides comprising the formula (Xaa) n -Zaa-(Xaa) m , 
(wherein Zaa is a non-degenerate phosphorylatable amino acid, Xaa is any amino acid and n 
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and m are integers from 1-10 inclusive), can be contained within a larger peptide or within a 
protein. For example, when a phage expression library is used, the peptide encompassing the 
phosphorylatable amino acid residue and the surrounding degenerate amino acid residues is 
contained within a phage protein expressed on the surface of the phage (i.e., the degenerate 

5 peptides are part of a fusion protein comprising the phage particle protein and the peptide 
inserts of the library). Additionally, for soluble synthetic peptide libraries, it may be 
desirable to add additional amino acid residues to one or both ends of the degenerate region 
of the peptides. Non-degenerate amino acids residues N-terminal to the degenerate region of 
the peptides can serve to verify that peptides from the peptide mixture are being sequenced 

10 properly and can be used to quantitate the amount of peptides present. Similarly, non- 
degenerate amino acid residues C-tenninal to the degenerate region of the peptides can be 
used for quantification purposes. The addition of a poly-lysine tail to the C-terminal end of 
the peptides of the library can prevent wash-out of peptides during automated sequencing and 
improve the solubility of the peptide mixture. 

15 Another embodiment of the peptide library of the invention specifically allows for the 

addition of non-degenerate amino acids at the N-terminal and/or C-terminal ends of the 
degenerate region of the peptides. In this embodiment, the oriented degenerate peptide 
library comprises non-phosphorylated peptides comprising a formula: 



20 Yj -(XaaVZaa-(Xaa)m-Y2 (SEQ ID NO: 2) 

wherein 

Zaa is a non-degenerate phosphorylatable amino acid selected from the 
group consisting of Ser, Thr and Tyr, 
25 Xaa is any amino acid, 

n and m are integers from 1-10 inclusive, 

Y j is hydrogen or a peptide having a formula (Xaa) a wherein Xaa is any 

non-degenerate amino acid and a is an integer from 1-15 inclusive, and 
Y2 is hydrogen or a peptide having a formula (Xaa)b wherein Xaa is any 
30 non-degenerate amino acid and b is an integer from 1-15 inclusive. 



This library encompasses peptides up to a length of 51 amino acids. Alternatively, a and b 
can be integers from 1-10 inclusive. This library encompasses peptides up to a length of 41 
amino acids. Alternatively, a and b can be integers from 1-5 inclusive. This library 
35 encompasses peptides up to a length of 3 1 amino acids. 

In yet another embodiment, the invention provides an oriented degenerate peptide 
library comprising non-phosphorylated peptides comprising a formula 
<SEQIDNO:3): 
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Met-Ala-Xaal-Xaa2-Xaa3-Xaa4-Xaa5-Xaa6-Xaa7-Xaa8-Xaa9-AIa-Lys-Lys-Lys 

wherein Xaal, Xaa2, Xaa3, Xaa4, Xaa6, Xaa7, Xaa« and Xaa9 are any amino acid residue 
and Xaa5 is selected from Ser, Thr and Tyr. 
5 An oriented degenerate peptide library provided by the invention is contacted with a 

protein kinase under conditions which allow for phosphorylation of a substrate by the protein 
kinase and the kinase is allowed to phosphorylate peptides within the oriented degenerate 
peptide library having a preferred sequence for phosphorylation by the protein kinase, thereby 
forming a population of phosphorylated peptides. The phrase "contacted under conditions 

1 0 which allow for phosphorylation of a substrate by the protein kinase" is intended to include 
any form of combining or incubating together of the kinase and the library under conditions 
which enable phosphorylation of substrate proteins or peptides by the kinase. Thus, these 
conditions will include the presence of ATP (or an ATP analogue) as a phosphate donor 
molecule (or analogue thereof). The phosphate of the phosphate donor molecule can be 

15 labeled, e.g., radiolabeled, to label peptides which become phosphorylated by the kinase (e.g., 
to allow for their detection by detecting the radiolabel). For example, 32 P-y-ATP can be used 
as the phosphate donor. Preferred conditions for contacting the kinase with the library are 
described in detail in Example 2. 

Following phosphorylation of phosphorylatable peptides within the oriented 

20 degenerate peptide library, the phosphorylated peptides are separated from the remaining 
non-phosphorylated peptides of the library. A preferred method for separating 
phosphorylated peptides from non-phosphorylated peptides is by binding the phosphorylated 
peptides to a ferric column. This type of column has been used previously to separate tryptic 
phosphopeptide fragments of phosphorylated proteins from non-phosphorylated tryptic 

25 fragments (Muszynska et al., (1986) Biochem. 25: 6850-6853; Muszynska et al, (1992) J. 
Chromatography 19-28). However, previously described protocols could not be used 
because the phosphopeptide were eluted from the column in such a way that they could not 
then be sequenced, which is the necessary subsequent step in the procedure (i.e., the washing 
and elution buffers were incompatible with subsequent sequencing of the phosphopeptides). 

30 The column washing and elution conditions were modified to allow for subsequent 

sequencing of the eluted phosphopeptides. In the modified procedure, the peptide mixture is 
loaded onto the column in high salt buffer of about pH 5.5-6.0 (e.g., 50 mM MES, 1 M NaCl, 
pH 5.5). In this buffer, phosphorylated peptides bind to the column whereas non- 
phosphorylated peptides flow through the column. Next, the column is washed with a very 

35 low salt buffer of about pH 6.0 (e.g., 2 mM MES, pH 6.0) to remove contaminating non- 
phosphorylated peptides and excess salt from the column. Finally, the phosphopeptides are 
eluted from the column with a buffer of about pH 8.0 (e.g., 500 mM NH 4 C0 3 , pH 8.0), A 
procedure for quantitative separation of phosphorylated peptides from non-phosphorylated 
peptides using a ferric column is described in detail in Example 3. 
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Alternatively, since many protein kinases are known to be capable of using ATP-y-S 
as a thio-phosphate donor to phosphorylate proteins and peptides, a mercury (Hg2+) column 
(pChloromercuribenzoate-Agarose: Pierce) could be used for separating thio-phosphorylated 
peptides from non-phosphorylated peptides. Such columns have been used in the past to bind 
5 thio-phosphorylated nucleotides and peptides (Sun, I.Y-C, and Allfrey, V.G. (1982) J. Biol 
Chem. 252:1347-1353 and Sun et ah, (1980) J. Biol Chem. 255:742-747). Since any thiol 
group will bind to the mercury column, Cys cannot be present in the library if the mercury 
column is used. Also, the kinase reaction is typically about 5 fold slower with ATP-y-S so 
more enzyme or a longer incubation time is likely to be necessary. 

10 The use of a mercury column to separate thio-phosphorylated peptides from non-thio- 

phosphorylated peptides can allow for the inclusion of phosphotyrosine, phosphoserine or 
phosphothreonine at degenerate positions in the library. In this case, many peptides within 
the library would be phosphorylated during the synthesis of the library and the kinase 
reaction would then be performed with ATP-y-S as a thio-phosphate donor to 

15 thiophosphorylate substrates of the kinase. The thiophosphorylated peptides would then be 
separated from the non-thio-phosphorylated peptides with a mercury column. 

Another theoretical alternative approach to separating phosphorylated peptides from 
non-phosphorylated peptides is to use an antibody-affinity column. For example, an anti- 
phosphotyrosine antibody-column could be used to separate peptides phosphorylated on 

20 tyrosine from non-phosphorylated peptides. However, because anti-phosphotyrosine 

antibodies have some specificity for the amino acid residues surrounding the phosphorylated 
tyrosine, the antibody column would impose additional selection on the mixture of 
phosphorylated peptides. Thus, it is likely that some peptides which can be substrates for the 
kinase (i.e., a proportion of the phosphorylated peptide mixture) will be lost during the 

25 column separation step, thereby artifactually altering the mixture of peptides which is 
sequenced. 

Prior to the step of separating the phosphorylated (or thio-phosphorylated) peptides 
from non-phosphorylated peptides, additional purification steps may be added to remove 
other components of the kinase reaction, such as the kinase itself and the free phosphate 

30 donor. For example, the kinase can be bound to a solid support, such as a bead, during the 
kinase reaction and then the supernatant, containing the phosphorylated and non- 
phosphorylated peptides can be removed, thereby separating the kinase from the peptide. 
Chromatography can be used to separate the free phosphate donor (e. g, ATP) from the 
peptides, for example a DEAE column can be used. 

35 After the phosphorylated peptides are separated from non-phosphorylated peptides, 

the amino acid sequences of the phosphorylated peptides are determined. Preferably, the 
oriented degenerate peptide library used is a soluble synthetic peptide library and the 
phosphorylated peptides are sequenced as a bulk population using an automated peptide 
sequencer (alternatively, manual sequencing could be performed). This approach provides 
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information on the abundance of each amino acid residue at a given cycle in the sequence ©f 
the phosphopeptide mixture, most importantly at the degenerate positions. For m rh 
degenerate position in the peptides of the library, a relative abundance value can then be 
calculated by dividing the abundance of a particular amino acid residue at that position after 
5 library screening (i.e., after peptide phosphorylation and separation) by the abundance of the 
same amino acid residue at that position in the starting library. Thus, the relative abundance 
(RA) of an amino acid residue Xaa at a degenerate position in a peptide can be defined as: 

R A » amount of Xaa in the population of phosp horvlated pep tiffep 
10 amount of Xaa in the oriented degenerate peptide library 

The relative abundance value may be corrected for background contamination as described in 
Example 4. Amino acid residues which are neither enriched for nor selected against in the 
population of peptides which can serve as substrates for the kinase (i.e., the phosphoiylated 

15 peptides) will have a relative abundance of 1 .0. Those amino acid residues which are 

preferred at a particular degenerate position (i.e., residues which are enriched at that position 
in the phosphorylated peptides) will have a relative abundance greater than 1 .0. Those amino 
acid residues which are not preferred (i.e., residues which are selected against at that position 
in the phosphorylated peptides) will have a relative abundance less than 1.0. Based upon the 

20 relative abundance values for each amino acid residue at a degenerate position, preferred 
amino acid residues, i.e., amino acid residues with a relative abundance greater than 1 .0, can 
be identified at that position. 

As discussed above, if a solid-support bound library or a phage expression library is 
used, each isolate must be sequenced individually (each individual isolated "bead" for the 

25 solid support bound library or each individual nucleic acid insert for each phage particle, 
wherein the peptide sequence is deduced from the amino acid sequence). This requires 
sequencing of large numbers of isolates and would likely provide only an estimation of the 
relative abundance of each amino acid residue at a degenerate position. 

Based upon the relative abundance of different amino acid residues at each degenerate 

30 position within the population of phosphorylated peptides, an amino acid sequence motif for 
a phosphorylation site of the protein kinase can be determined. The amino acid sequence 
motif encompasses the degenerate region of the peptides. The particular amino acid residues 
chosen for the motif at each degenerate position are those which are most abundant at each 
position. Thus, an amino acid residue(s) with a relative abundance value greater than 1.0 at a 

35 particular position can be chosen as the amino acid residue(s) at that position within the 

amino acid sequence motif. Alternatively, a higher relative abundance value can be used as 
the basis for inclusion of an amino acid residue to create an even more preferred 
phosphorylation site for a kinase. For example, an amino acid residue(s) with a relative 
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abundance value equal to or greater than 1 .5 at a particular position can be chosen as the 
amino acid residue(s) at that position within the amino acid sequence motif. 

The analysis of data obtained from sequenced phosphorylated peptides is described in 
more detail in Example 4 and determinations of amino acid sequence motifs for seventeen 

5 known protein kinases are described in detail in Examples 5-9 and 11-12. 

The method of the invention can be used to determine an amino acid sequence motif 
for the phosphorylation site of any kinase. For example, the kinase can be a protein- 
serine/threonine specific kinase (in which case a library with a fixed non-degenerate serine or 
threonine is used), a protein-tyrosine specific kinase (in which case a library with a fixed non- 

0 degenerate tyrosine is used) or a dual-specificity kinase (in which case a library with either a 
fixed non-degenerate serine, threonine or tyrosine can be used). Examples of protein kinases 
which are encompassed by the invention can be found in Hanks et al. (1988) Science 241 :42- 
52. 

Protein-serine/threonine specific kinases encompassed by the invention include: 1) 

5 cyclic nucleotide-dependent kinases, such as cyclic-AMP-dependent protein kinases (e.g.; 
protein kinase A) and cyclic-GMP-dependent protein kinases; 2) calcium-phospholipid- 
dependent kinases, such as protein kinase C; 3) calcium-calmodulin-dependent kinases, 
including CaMII, phosphorylase kinase (PhK), myosin light chain kinases (e.g., MLCK-K, 
MLCK-M), PSK-H1 and PSK-C3; 4) the SNF1 family of protein kinases (e.g., SNF 1, niml, 

0 KIN1 and KIN2); 5) casein kinases (e.g., CKII); 6) the Raf-Mos proto-oncogene family of 
kinases, including Raf, A-Raf, PKS and Mos; and 7) the STE7 family of kinases (e.g. STE7 
and PBS2). Additionally, the protein-serine/threonine specific kinase can be a kinase 
involved in cell cycle control. Many kinases involved in cell cycle control have been 
identified. Cell cycle control kinases include the cyclin dependent kinases, which are 

5 heterodimers of a cyclin and kinase (such as cyclin B/p33 c dc2 y cyclin A/p33^DK2 f C y C iin 
E/p33 CDK2 and cyclin Dl/p33 CDK4 ). Other cell cycle control kinases include Weel 
kinase, Niml/Cdrl kinase, Wisl kinase and NIMA kinase. 

Protein-tyrosine specific kinases encompassed by the invention include: 1) members 
of the src family of kinases, including ppeQC-src, ppeo^^ Y<rs, Fgr, FYN, LYN, LCK, 

[) HCK, Dsrc64 and Dsrc28; 2) members of the Abl family of kinases, including Abl, ARG, 
Dash, Nabl and Fes/Fps; 3) members of the epidermal growth factor receptor (EGFR) family 
of kinases, including EGFR, v-Erb-B, NEU and DER; 4) members of the insulin receptor 
(INS.R) family of growth factors, including INS.R, IGF1R, DILR, Ros, 71ess, TRK and 
MET; 5) members of the platelet-derived growth factor receptor (PDGFR) family of kinases, 

5 including PDGFR, CSF1R, Kit and RET. 

Other protein kinases which can be used in the method of the invention include syk, 
ZAP70, Focal Adhesion Kinase, erkl,«rk2,«rk3, MEK, CSK, BTK, ITK, TEC, TEC-2, 
JAK-1, JAK-2, LET23, c-fins, S6 kinases (including p70 S6 and RSKs), TGF-p/activin 
receptor family kinases and Clk. 
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The amino acid sequence motifs determined by the method of the invention are useful 
for predicting whether a protein is a substrate for a particular protein kinase* The primary 
amino acid sequence of a known protein can be examined for the presence of the determined 
amino acid sequence motif. If the same or a very similar motif is present in the protein, it can 
5 be predicted that the protein could be a substrate for that protein kinase. 

The amino acid sequence motifs determined by the method of the invention are also 
useful for designing peptide substrates for protein kinases. Accordingly, the invention also 
provides substrates for protein kinases. These substrates are peptides (or proteins containing 
these peptides) having an amino acid sequence corresponding to the amino acid sequence 
10 motif of the phosphorylation site of the kinase determined by the method of the invention. 
Theoretically, a substrate for any protein kinase can be determined by the invention. The 
invention provides for a peptide substrate for a protein kinase comprising an amino acid 
sequence of the formula (SEQ ID NO:4): 

1 5 Xaal -Xaa2-Xaa3-Xaa4-Xaa5-Xaa6-Xaa7-Xaa8-Xaa9 

wherein Xaa5 is Ser, Thr or Tyr, and Xaal, Xaa2, Xaa3, Xaa4, Xaa6, Xaa7, Xaa8 and Xaa9 
are determined by the method of the invention. Thus, the protein kinase is contacted with an 
oriented degenerate peptide library comprising non-phosphorylated peptides comprising a 
20 formula (SEQ ID NO: 3): 

Met-Ala-Xaal-Xaa2-Xaa3-Xaa4-Xaa5-Xaa6-Xaa7-Xaa8-Xaa9-Ala-Lys-Lys-Lys 

wherein Xaal, Xaa2, Xaa3, Xaa4, Xaa6, Xaa7, Xaa8, and Xaa9 are any amino acid except an 
25 amino acid selected from the group consisting of Ser, Thr and Tyr and Xaa5 is selected from 
a group consisting of Ser, Thr and Tyr, under conditions which allow for phosphorylation of 
a substrate by the protein kinase. The protein kinase is allowed to phosphorylate peptides 
within the oriented degenerate peptide library having a phosphorylation site for the protein 
kinase to form a population of phosphorylated peptides. The population of phosphorylated 
30 peptides are separated from non-phosphorylated peptides by binding the population of 

phosphorylated peptides to a ferric column and the amino acid sequences of the population of 
phosphorylated peptides are determined. From the sequencing information, a relative 
abundance (RA) value for each amino acid residue at positions in the phosphorylated 
peptides corresponding to Xaal , Xaa2, Xaa3, Xaa4, Xaa6, Xaa7, Xaa8, and Xaa9 is 
35 determined. The relative abundance value is determined by the formula: 



R A = amount of Xaa in the population of phosphorylated peptides 
amount of Xaa in the oriented degenerate peptide library 
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The amino acid sequences of Xaal, Xaa2, Xaa3, Xaa4, Xaa6, Xaa7, Xaa8, and Xaa9 in the 
substrate for the protein kinase are determined by selecting amino acid residues that have a 
relative abundance value greater than 1 .0 at positions in the phosphorylated peptides 
corresponding to Xaal, Xaa2, Xaa3, Xaa4, Xaa6, Xaa7, Xaa8, and Xaa9. When the substrate 
is a substrate for a protein-serine/threonine kinase, Xaa5 is Ser or Thr, whereas when the 
substrate is a substrate for a protein-tyrosine kinase Xaa5 is Tyr. 

The invention further provides substrates for protein kinase A, cell cycle control 
kinases, src family kinases, EGF receptor, pp92 c " f P s/fes , lck, c-abl, PDGF receptor, FGF 
receptor, insulin receptor, casein kinase II, NIMA, phosphorylase kinase, Cam kinase II and 
Erk 1. These substrates are of the general formula (SEQ ID NO:5): 

Y,-Xaal-Xaa2-Xaa3-Xaa4.Xaa5-Xaa6.Xaa7-Xaa8-Xaa9-Y 2 

The length of a peptide which can be used as a substrate for a kinase is variable. For 
example, although the amino acid sequence motifs determined in this invention are 9 amino 
acids in length, a peptide which is shorter than 9 amino acids may be a substrate for a kinase. 
In certain cases, a peptide as short as 3 amino acids in length may be used as a substrate. 
Accordingly, Xaal, Xaal-Xaa2, Xaal-Xaa2-Xaa3, Xaa9, Xaa8-Xaa9 and Xaa7-Xaa8-Xaa9 
may or may not be present within the substrate. For example, the substrate may only be 
composed of Xaa2-Xaa3-Xaa4-Xaa5-Xaa6-Xaa7-Xaa8 or Xaa3-Xaa4-Xaa5-Xaa6-Xaa7 or 
Xaa4-Xaa5«Xaa6. If Xaal, Xaal-Xaa2 or Xaal-Xaa2-Xaa3 are absent, then Yj is hydrogen 
or an amino-derivative group. Similarly, if Xaa9, Xaa8-Xaa9 or Xaa7-Xaa8-Xaa9 are absent, 
then Y2 is hydrogen or a carboxy-derivative group. 

Alternatively, the peptide substrate may be longer than 9 amino acids when Xaal- 
Xaa2-Xaa3 and or Xaa7-Xaa8-Xaa9 are present. When Xaa 1 -Xaa2-Xaa3 are present, Yj can 
be hydrogen, an amino-derivative group or a peptide of the formula (Xaa) a , wherein a is an 
integer from 1-15 inclusive. Likewise, when Xaa7-Xaa8-Xaa9 are present, Y 2 can be 
hydrogen, a carboxy-derivative group or a peptide of the formula (Xaa)^, wherein b is an 
integer from 1-15 inclusive. When a and b are integers from 1-15, this encompasses peptides 
up to a length of 39 amino acids. Alternatively, a and b can be integers from 1-10 inclusive. 
This encompasses peptides up to a length of 29 amino acids. Alternatively, a and b can be 
integers from 1-5 inclusive. This encompasses peptides up to a length of 19 amino acids. 

The sequences of Xaal, Xaa2, Xaa3, Xaa4, Xaa6, Xaa7, Xaa8 and Xaa9 within the 
substrates of the invention are chosen from the amino acid sequence motif for the particular 
kinase. 

Amino-derivative groups which can be present at the N-terminus of a peptide 
substrate (i.e., can be Yj) include epoxysuccinyl, cholesteryl, aryl, aralkyl and acyl 
derivative. Carboxy-<ierivative groups which can be present at the C-terminus of a peptide 
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substrate (i.e., can be Y 2 ) include alcohol, aldehyde, epoxysuccinate, acid halide. carbonyl, 
halomethane, and diazomethane derivatives. 

Amino acid sequence motifs for protein kinase A, cell cycle control kinases 
(including cyclin B/p33 cdc2 and cyclin A/p33 c °K2), src family kinases (including 
5 pp60 c - src and pp60 v -src), EGF receptor, pp92 c - fe s/*K Ick, c-abl, PDGF receptor, FGF 
receptor, insulin receptor, casein kinase II, NIMA, phosphorylase kinase, Cam kinase II and 
Erkl are provided by the invention. These motifs are described in detail in Examples 5-9 and 
11-12. 

Peptides composed of amino acid residues corresponding to one of the possible 

10 residues prescribed by the amino acid sequence motif of a kinase can be synthesized to create 
a peptide substrate for the kinase. Peptides composed of the most preferred amino acid 
residues of the motif (i.e, amino acid residues having the highest relative abundance at each 
degenerate position) can be synthesized as optimal peptide substrates for the kinase. The 
affinity of a peptide substrate for a kinase can be measured by standard techniques and the 

15 maximal rate of the kinase reaction (Vmax) and the concentration of peptide substrate at 
which the rate of the reaction is half maximal (Km) can be determined. For example, assays 
such as those described in Example 10 can be performed. As demonstrated in Example 10, 
optimal peptide substrates for cyclin B/pSS^ 2 , cyclin A/p33 CDK2 and ppeov-s 10 , 
synthesized based upon the amino acid sequence motifs of the invention, have the highest 

20 affinities yet reported for these kinases. 

It should be appreciated that it may be possible to make amino acid substitutions in a 
substrate peptide of the invention without substantially changing the affinity of the peptide 
for a kinase. For example, amino acid substitutions are likely to be possible at a position 
within the peptide which is relatively insensitive to variations in amino acid residues at that 

25 position. Similarly, amino acid substitutions are likely to be possible at a position which can 
broadly accommodate particular types of amino acids (e.g., hydrophobic or hydrophilic 
amino acids). The oriented degenerate peptide library used to determine an amino acid 
sequence motif of the invention represents peptides having every possible amino acid 
substitution at each degenerate position and the sequences determined from the library 

30 screening provide information regarding the effect of substituting a particular amino acid at 
each degenerate position surrounding a phosphorylation site. Thus, positions at which 
substitutions from the defined motif may be possible are readily identifiable from the data 
obtained from the sequenced phosphorylated peptides from the library (i.e, from the relative 
abundance values for each amino acid residue at each degenerate position). For example, a 

35 position which is relatively insensitive to variations in amino acid residues will have many 
amino acid residues at that position with relative abundance values around 1.0 (i.e, neither 
• substantially selected for or against in the library screening). While one or more amino acid 
residues at that position may have an RA greater than 1 .0 (the value set for inclusion in an 
amino acid sequence motif of the invention), it may be possible to substitute this residue with 
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an amino acid residue having an RA of 1.0 or slightly less than one at that position without 
substantially affecting the affinity of the peptide substrate for the kinase. For example, for 
src family kinases, the position at -4 relative to the phosphorylated amino acid (at position 0) 
can displays many amino acids with relative abundance values around 1.0 (see figure 3, box 
5 A), indicating that this position can accommodate many different amino acids. Likewise, a 
position which can broadly accommodate, for example, a hydrophobic residue, will have 
many hydrophobic residues at that position with RA values around 1.0, although only one or 
a few residues may have the preferred RA value of greater than 1.0. 

Accordingly, the invention encompasses a peptide substrate for a protein kinase 

1 0 having substantially equal affinity for the kinase as a peptide substrate comprising an amino 
acid sequence motif presented herein. The term "substantially equal affinity" is intended to 
mean that the peptide has an affinity for the kinase such that it can function in substantially 
the same way as a peptide having an amino acid sequence motif presented herein (e.g., the 
peptide can competitively inhibit the activity of a kinase against another substrate 

1 5 approximately equivalently to a peptide having an amino acid sequence motif presented 

herein). The affinity of a peptide for a kinase can be determined by standard techniques (such 
as described in Example 10) and can be quantitatively expressed as a Km value (i.e., the 
concentration of peptide at which the rate of the kinase reaction is half maximal). Preferably, 
a peptide having substantially equal affinity for a kinase as a peptide substrate comprising an 

20 amino acid sequence motif presented herein has a Km value no more than 2-fold greater than 
the Km of the peptide having the amino acid sequence motif. Alternatively, an affinity of a 
peptide for a kinase can be defined as a Kj value (i.e., the concentration of peptide needed to 
inhibit kinase activity toward another substrate by 50 %) by competition with other 
substrates. Preferably, a peptide having substantially equal affinity for a kinase as a peptide 

25 substrate comprising an amino acid sequence motif presented herein has a Kj value no more 
than 2-fold greater than the Kj of the peptide having the amino acid sequence motif. Km and 
Kj values for some peptide substrates having an amino acid sequence motif of the invention 
are presented in Example 10. 

The substrates of the invention can be contained within a larger protein by modifying 

30 a protein to contain the amino acid sequence of the phosphorylation site motif for a kinase. 
For example, a nucleotide sequence encoding a peptide corresponding in sequence to one of 
the possible amino acid sequences of the amino acid sequence motif can be deduced from the 
genetic code. (Because of the degeneracy of the genetic code, more than one nucleotide 
sequence may encode for the same peptide). A nucleic acid having this nucleotide sequence 

35 can be inserted into the coding sequence of a known protein by standard recombinant DNA 
technology. Alternatively, standard site-directed mutagenesis can be used to modify a protein 
to contain an amino acid sequence corresponding to a phosphorylation site for a kinase. 
Nucleotides within the coding region of a protein are mutated to nucleotides encoding a 
substrate of the invention. Alternatively, a fusion protein can be constructed, composed of a 
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protein fused to a substrate of the invention, by standard recombinant DNA technology. 
Accordingly, the invention provides proteins modified to contain the substrates of the 
invention. Because such a modified protein contains a phosphorylation site for a particular 
protein kinase, the protein can serve as substrates for that protein kinase. For example, a 
5 protein may be modified to contain a phosphorylation site for a protein kinase in order to * 
modulate a signal transduction pathway or to correct a defect in a mutant protein which 
normally is phosphorylated by the kinase but in which the mutation disrupts this 
phosphorylation. 

The invention also provides pseudosubstrates for PKA, cell cycle control kinases 
10 (including cyclin B/p33 cdc2 and cyclin A/p33 CDK2 ), src family kinases (including pp60 c - 
src and pp 60 v " src ), EGF receptor, p92 c - f ps/fes > lck, c-abl, PDGF receptor, FGF receptor, 
insulin receptor, casein kinase II, NIMA, phosphorylase kinase, Cam kinase II and Erkl . The 
amino acid sequences of these pseudosubstrates are based upon the amino acid sequence 
motif for the phosphorylation sites of these kinases. In the pseudosubstrates, the amino acid 

1 5 residue which is normally phosphorylated by the kinase is changed to a non-phosphorylatable 
amino acid residue. For serine/threonine kinase substrates, the preferred pseudosubstrate 
peptides contain alanine at the position which is normally serine or threonine in the substrate 
peptide. The structure of the side chain of alanine is identical to that of serine except that 
serine has a phosphorylatable hydroxy group where alanine has hydrogen. Thus, this change 

20 in the substrate peptide to create a pseudosubstrate peptide does not dramatically alter the 

chemical structure of the peptide (which could disrupt its ability to bind to the catalytic site of 
the kinase) but destroys its ability to be phosphorylated by the kinase. A native inhibitor of 
protein kinase A is known to contain alanine (rather than serine or threonine) within a 
consensus motif for the PKA phosphorylation site (see Cheng, H-C, (1985) Biochem. J. 

25 221:655-661 and Table 2 in Example 5). For the same reasons, the preferred pseudosubstrate 
peptides for protein-tyrosine kinases contain phenylalanine at the position normally occupied 
by tyrosine. The side chain of phenylalanine is identical to tyrosine except that tyrosine has a 
phosphofylatable hydroxy group where phenylalanine has hydrogen. Alternatively, for 
serine/threonine kinases, the phosphorylatable serine or threonine can be substituted with 

30 asparagine or glycine to mimic the serine or threonine residue, and for tyrosine kinases, the 
phosphorylatable tyrosine can be substituted with histidine to mimic the tyrosine residue. 

It is possible that other amino acids can be substituted for the phosphorylatable 
residue in the peptide substrate (i.e., the serine/threonine or tyrosine residue) to create a 
pseudosubstrate. The requirements are that the substituted residue is not phosphoiylatable by 

35 the particular kinase against which the pseudosubstrate is directed and that the 

pseudosubstrate can still bind to the catalytic site of the kinase. This can be determined by 
examining the ability of the pseudosubstrate to competitively inhibit the activity of the 
kinase. Standard techniques for assessing this are known in the art For example, a kinase 
can be incubated with a native substrate or peptide substrate and [y-^pj.^Tp m ^ 
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presence or absence of the pseudosubstrate under conditions which normally allow 
phosphorylation of the substrate by the kinase. Phosphorylation of the native or peptide 
substrate is measured by determining the transfer of radiolabeled phosphate to the substrate 
(i.e, by detecting radiolabeled substrate) in the presence and absence of the pseudosubstrate. 
5 A pseudosubstrate which binds to the catalytic site of a protein kinase will block the catalytic 
site of the kinase and inhibit transfer of radiolabeled phosphate to the substrate. An example 
of a competitive inhibition assay which can be used to evaluate pseudosubstrates is described 
in detail in Example 10. Known substrates for particular kinases can be used as substrates in 
this type of assay. For example, acid-denatured enolase can be used as a substrate for pp60 v * 
10 src (Kaplan, D., et al. (1986) Proc. Natl Acad ScL USA £2:3624-3628) or histone HI can be 
used as a substrate for p33 cdc2 (Atherton-Fessler, S., et al. (1993) Mol Cell Biol 11:1675- 
1685). 

A pseudosubstrate can be used to inhibit the activity of a kinase. It may be desirable 
to inhibit the activity of a kinase to modulate a signal transduction pathway. For example, 

1 5 certain oncogenic and/or viral forms of protein kinases have lost the ability to be down 

regulated and are therefore in a continually activated state. A pseudosubstrate could be used 
to downmodulate the activity of the kinase by blocking the catalytic site of the kinase, 
thereby preventing the kinase from phosphorylating native substrates. 

The amino acid sequence motifs provided by the invention can also be used to design 

20 peptide analogs of the substrates of the invention. It has generally been found that when 

peptides are added to intact cells or administered in vivo their uptake is inefficient and/or they 
are degraded. Thus, in situations in which one wants to modulate a signal transduction 
pathway (for example, for therapeutic purposes in vivo) it is desirable to use a peptide analog 
rather than a peptide. The term "peptide analog" as used herein is intended to include 

25 molecules which mimic the chemical structure of a peptide and retain biological properties of 
the peptide but which are cell membrane permeable. The term "peptide analog" is also 
intended to include molecules which mimic the chemical structure of a peptide and retain 
biological properties of the peptide but which are more stable <e.g., are less susceptible to 
degradation either in the circulation in vivo or intracellularly) than the peptide. In the case of 

30 peptide analogs of the peptide substrates of the invention, the peptide analog retains the 

biological property of being able to bind to the protein kinase (i.e., occupy the catalytic site of 
the protein kinase). Preferably, the peptide analog is not phosphorylated by the protein 
kinase (i.e., the peptide analog is an analog of a pseudosubstrate), although a 
phosphorylatable peptide analog may also be useful. Approaches to designing peptide 

35 analogs are known in the art. For example, see Farmer, P.S. in Drug Design (E.J. Aliens, ed.) 
Academic Press, New York, 1980, vol. 10, pp. 1 19-143; Ball. J.B. and Alewood, P.F. (1990) 
1 Mol Recognition 2:55; Morgan, B.A. and Gainor, J.A. (1989)i4nn. Rep. Med Chem. 
24:243; and Freidinger, R.M. (1989) Trends Pharmacol Scl lfl:270. 
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One approach to designing a peptide analog of the peptide substrates of the invention 
is to modify a protein kinase inhibitor to contain amino acid residues of the amino acid 
sequence motif for a particular kinase. For example, a number of small non-peptidic 
compounds have been shown to be competitive inhibitors of protein tyrosine kinases. For a 
5 review see Burke (1992) Drugs Fut 12:1 19-131. Some of these compounds are competitive 
with respect to the ATP binding site on the kinase, whereas others are competitive with 
respect to the phosphoacceptor site (i.e., the substrate binding site) on the kinase. 
Compounds which are competitive wi A respect to the substrate are thought to mimic the 
tyrosyl-containing substrate (Levitzki (1990) Biochem. Pharmacol 4&9\3-9\%\ Shiraishi et 

10 aL (1989) Cancer Res. 42:2374-2378; and Imoto et al. (1987) 1 Antibiot. 4Q: 147 1-1473). 

While some of these compounds display specificity for tyrosine kinases over serine/threonine 
kinases (Shiraishi et al. WO 88/07035 and Shiraishi et al. (1988) Chem. Pharm. Bull 2fi:974- 
981) and some degree of selectivity among protein tyrosine kinases (Yaish et al. (1988) 
Science 242:933-935; Gazit et al. (1989) J. Med. Chem. 22:2344-2352; Gazit et al. (1991) J. 

15 Med Chem. 24:1896-1907; Levitzki and Gilon (1991) Trends Pharmacol Sci. 12:171-174; 
Levitzki (1990) Biochem. Pharmacol 4Q:913-918; Shiraishi et al. (1989) Cancer Res. 
42:2374-2378), they are not highly specific for particular protein tyrosine kinases. The 
specificity of compounds which are competitive with the substrate binding site on tyrosine 
kinases may be increased by adding to these compounds amino acids found within a preferred 

20 phosphorylation site for a particular kinase (i.e., amino acids surrounding the phosphorylation 
site). 

For example, a class of styryl-containing compounds have been described which are 
competitive inhibitors of the substrate binding site of protein tyrosine kinases (see for 
example CA 109:210674t; CA 106:201757h; CA 106:201758j; CA 106:201762f; CA 

25 106;201763g; CA 107:96584v; CA 107:58668t; CA 106:213937e; CA 106:213918z; CA 
1 10:212392u; WO 88/07035; and Shiraishi et al. (1987) Biochem. Biophys. Res. Commuru 
142:322-328). These styryl-containing compounds include erbstatin (Nakamura et al. (1986) 
J. Antibiot. 22:314-315; and Isshiki et al. (1987) J. Antibiot. 40:1209-1210), piceatannol 
(Geahlen and McLaughlin (1989) Biochem. Biophys. Res. Commun. 165:241-245) and 

30 tyrphostins (EP 0 322 738 A2; Yaish et al. (1988) Science 242:933-935; Gazit et al. (1 989) J. 
Med Chem. 22:2344-2352; Gazit et al. (1991) J. Med Chem. 24:1896-1907; Levitzki and 
Gilon (1991) Trends Pharmacol ScL 12:171-174; Levitzki (1990) Biochem. Pharmacol 
40:913-918). Amino acid residues located N-teraiinal and C-terminal to the phosphorylated 
tyrosine within an amino acid sequence motif of a kinase (e.g., at the -1 and +1 positions 

35 relative to a phosphorylated tyrosine at position 0) could be added to these compounds to 
increase the specificity of the compound for a particular kinase. 

Another type of peptide analog of the peptide substrates of the invention which can be 
made is a peptide which has methylated amide linkages. This modification of the peptide has 
been found to increase the membrane permeability of the peptide and increase its stability. 



WO 95/18823 



-25- 



PCTAJS95/00147 



Additionally, the rigidity of the peptide may be increased, which may increase the affinity of 
the peptide analog for the catalytic site of the kinase relative to the unmodified peptide 
substrate. 

5 Industrial Applications 

The peptide substrates of the invention are useful for detecting and quantitating the 
activity of protein kinases in vitro. For example, a solution can be contacted with a substrate 
of the invention to detect the presence of a particular kinase in the solution. For example, the 
solution can be a cell extract or a column fraction of a partially purified kinase. The peptides 

10 substrates can also be used to determine the effect of various agents (e.g, a pharmaceutical 
compound) on the activity of a kinase by examining what effect the agent has on the 
phosphorylation of the peptide substrate by the kinase. For in vitro assays, the peptide 
substrate can be immobilized on a solid support (e.g., a bead) to allow for easy separation of 
the peptide from other reaction components (e.g., [y-32p]-ATP). With regard to these in vitro 

15 uses pertaining to monitoring the presence and activity of kinases, it should be noted that the 
affinity of certain peptide substrates of the invention for particular kinases is higher than has 
been previously observed for any native or synthetic peptide substrate, including 
commercially available peptide substrates marketed for these uses (see Example 10). 

The peptide substrates of the invention are also useful for raising antibodies against 

20 these peptides. Such antibodies would be specific for preferred phosphorylation sites of a 
protein kinases. These antibodies could then be used to identify native substrates for the 
protein kinase, for example by using the antibodies to immunoprecipitate proteins from cell 
extracts. Proteins immunoprecipitated by the antibodies would contain a preferred 
phosphorylation site for the kinase and therefore would be candidates for substrates of the 

25 kinase. (The ability of a protein so identified to serve as substrate for the kinase could then 
be tested in vitro by standard techniques). Considering that a limiting factor in identifying 
native substrates for certain protein kinase is their low abundance in cells, antibodies directed 
against preferred phosphorylation sites for a kinase may offer a means by which to purify low 
abundance substrates. 

30 The peptide substrates of the invention can also be used to competitively inhibit the 

activity of a protein kinase. Because of their high affinity for a protein kinase (i.e., higher 
than the previously reported affinities of other substrates for the kinase), these peptides are 
likely to be very efficient competitive inhibitors. The ability of a peptide substrate of the 
invention to competitively inhibit the activity of a kinase is demonstrated in Example 10. 

35 The pseudosubstrates and peptide analogs of the invention can also be used to inhibit the 

activity of a kinase since these compounds retain the ability to bind to the catalytic site of the 
kinase. The invention provides a method for inhibiting phosphorylation of a native substrate 
by a kinase by contacting the kinase with an inhibitory amount of a peptide substrate of the 
invention to competitively inhibit phosphorylation of the native substrate by the kinase. 
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Alternatively, the kinase can be contacted with an inhibitory amount of a pseudosnbstrate or 
peptide analog of the invention. 

Protein kinases play a role in virtually all signal transduction pathways, including 
those involved in cell cycle control, immunological responses, transcriptional activation and 
cell development. Moreover, protein kinases play a role in cell transformation. The abiKty to 
inhibit the activity of protein kinases thus provides a means by which to modulate a wide 
variety of cellular responses. Novel therapeutic agents composed of or based upon the 
peptide substrates, pseudosubstrates and peptide analogs of the invention may thus have 
applications in a wide variety of disease situations. 

This invention is further illustrated by the following examples which should not be 
construed as limiting. The contents of all references and published patents and patent 
applications cited throughout the application are hereby incorporated by reference. 

15 E XAMPLE 1 ; Construction of an Oriented Degenerate Peptide Library 

A soluble synthetic peptide library was synthesized by N«-Fmoc-based synthesis 
conducted on a Milligen/Biosearch 9600 synthesizer using standard (benzotriazolyloxy)tris- 
(dimemylainmo>phosphomum he 

20 coupling protocols and protected amino acids. For degenerate positions, the resin was 

deprotected with 30 % piperidine as usual and washed extensively with dimethylformamide. 
The solvent was removed and the moist resin is divided into 18 equivalent amounts by 
weight. Each aliquot was coupled for 1 8 hours with 4 equivalents each of BOP, 1- 
hydroxybenzo-triazole, and a different N«-Fmoc-amino acid having the appropriate side 

25 chain protecting group. All aliquots were recombined and the procedure was repeated at each 
degenerate position. Peptide cleavage and side chain deprotections were with trimethylsily] 
bromide as described (Domchek, S, et al. (1993) Biochem. 21:9865-9870). The peptide 
mixture was precipitated with diethyl ether (4° C) and desalted on a column of Bio-Gel P2. 
The following peptide library was constructed (SEQ ID NO-3V 

30 

Met-Ala-Xaa-Xaa-Xaa-Xaa-Ser-Xaa-Xaa-Xaa-Xaa-Ala-Lys-Lys-Lys 

where Xaa indicates all amino acids except Trp, Cys, Tyr, Ser, or Thr. A similar oriented 
degenerate peptide library was synthesized with Tyr at position 7. The Tip and Cys were 
35 omitted to eliminate problems with sequence detection and oxidation and Tyr, Ser and Thr 
were omitted from the degenerate positions to insure that the only potential site of 
phosphorylation is the residue at position 7. The total theoretical degeneracy of this library is 
1 5» = 2,562,890,625. The 4 degenerate residues N-tenninal and C-terminal of the site of 
phosphorylation cover the region most likely to be involved in catalytic site recognition on 
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the basis of the motifs that have been determined for protein kinases to date. The Met-Ala at 
the N-terminus provides two amino acids to verify that peptides from this mixture are being 
sequenced and a quantification of the peptides present. Similarly, the Ala at residue 12 
provides a second quantification and an estimate of how much loss of peptide has occurred 
5 during sequencing. The poly-Lys tail prevents wash-out during sequencing and improves the 
solubility of the mixture (no solubility problems have occurred at neutral pH and 3 mg/ml). 
The peptide mixture was sequenced to insure that all 15 amino acids were present at similar 
amounts at all 8 degenerate positions. 

10 EXAMPLE 2: Phosphorylation of a Peptide Library by a Protein Kinase 

The degenerate peptide library constructed as described in Example 1, with either 
serine or tyrosine at the 7th position, was used as a substrate for several different protein 
kinases. Protein kinase A (bovine) was obtained commercially (Sigma). GST-cyclin 

15 B/p33cdc2 (Atherton-Fessler et al. (1993) Mol CellBiol 11:1675-1685), GST- 

cyclinA/p33 CDK 2 (Atherton-Fessler et al. (1993) Mol CellBiol 12:1675-1685), human 
EGF receptor, chicken pp60 v ' src , polyoma virus middle T antigen/ppeO 0 ' 5 * 0 (Piwnica- 
Wonns et al. (1990) J. Virol M:61-68) and pp92 c -*P s/fes were expressed by baculoviruses. 
The baculovirus-expressed protein kinases were immobilized on protein A beads using 

20 precipitating antibodies or on glutathione-agarose bead for kinases obtained as GST fusion 
proteins to minimize the problem of separating the peptides from the enzyme or impurities in 
the enzyme. A monoclonal antibody against pp60 v " src (Ab-1) was commercially obtained 
(Oncogene Science). The EGF receptor monoclonal antibody (13A9) was previously 
described. The fps/fes-specific regressing-tumor antiserum was previously described 

25 (Feldman et al, (1987) Oncogene Res. 1:441-458). In the case of PKA, the reaction was 
done in solution and the peptides were separated from the kinase through a spin -column 
(Centricon 10). 

In a typical reaction, the protein kinase was added to 300 ^1 of solution containing 1 
mg of degenerate peptide mixture, 100 *iM ATP with a trace of [y-^PJ-ATP (roughly 6 x 10 5 

30 cpm), 1 mM DTT, 10 mM MgCl 2 or MnCl 2 and buffered to pH 7.0 with Tris. The 

concentration of peptides in the reaction was approximately 1.8 mM. The reaction mixture 
was incubated at 25°. It is preferable that at least 0.5 % to 1 % of the peptides in the library 
are phosphorylated during the kinase reaction (for reasons discussed in Example 4) and 
reaction times were chosen accordingly to achieve this result. Incubation times in the range 

35 of 254 hours were used. 
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EXAMPLE 3; Separation of Phosphorylated Peptides from 

Non-Pbosphorylated Peptides and Peptide Sequencing 

After a kinase reaction was performed as described in Example 2, the peptide 
5 supernatant was removed and diluted with 300 \il of 30 % acetic acid. This mixture was then 
added to a 1 ml DEAE column previously equilibrated with 30 % acetic acid and the column 
was eluted with 30 % acetic acid (Kemp et al, (1976) Proc. Natl Acad Set USA 22:1038- 
1042). In initial experiments, the fractions from the column were analyzed for peptide, 
phosphopeptide, [y- 32 P]-ATP, and 32 P0 4 by phosphocellulose paper (P8 1), TLC or SDS 

10 PAGE. It was determined that after the 600 \il void volume, the next 1 ml contained both 
phosphorylated and non-phosphorylated peptides but was free of [y^PJ-ATP. This fraction 
was collected and lyophilized. Since the peptide fraction was free of [y- 32 P]-ATP, the 
radioactivity in this fraction provided an initial estimate of the fraction of the total peptide 
mixture that was phosphorylated. 

1 5 A ferric chelation column (IDA beads, Pierce) was used for separation of 

phosphopeptides. This column has been used in the past to separate tryptic phosphopeptides 
of phosphorylated proteins from the bulk of non-phosphorylated tryptic peptides (Muszynska 
et al., (1986) Biochem. 21: 6850-6853; Muszynska et al., (1992) J. Chromatography QM:19- 
28). However, we discovered that in order to accomplish a quantitative removal of the 

20 non-phosphorylated peptides from the phosphopeptides in the mixture without loss of a 

subfraction of phosphopeptides, it was necessary to change the loading and elution conditions 
from published procedures. A 1 ml column of IDA beads (Pierce) was charged with 5 ml of 
20 mM ferric chloride at 1 ml/2 min, washed with 8 ml of water at 1 ml/min, washed with 6 
ml of 500 mM NH4CO3 pH 8.0 at 1 ml/min, washed again with 6 ml of water and then 

25 equilibrated with 6 ml of buffer A (50 mM MES, 1 M NaCl, pH 5.5). The dried sample of 
peptide/phosphopeptide mixture was dissolved in 200 \il of buffer A and loaded onto the 
ferric column. The column was then eluted with 5 ml of buffer A followed by 5 ml of buffer 
B (2 mM MES, pH 6.0). The phosphopeptides were then eluted with 4 ml of 500 mM 
NH4CO3, pH 8.0. Finally, the Fe3+ was eluted with 100 mM EDTA pH 8.0. All elutions 

30 were at 0.5 ml/min. and 0.5 ml fractions were collected. The amount of peptide at each 
fraction was estimated by absorbency at UV 280nm (fractions 1-10) and the amount of the 
phosphorylated peptides present was estimated by the incorporated radioactivity. Less than 
0.1 % of the total non-phosphorylated peptides eluted at fraction 1 1-14 as judged by the 
presence of a non-phosphorylated residue at cycle 7 in the sequence of this mixture. Greater 

35 than 90 % of the radioactivity applied eluted in fractions 1 1-14. A graph depicting the 
elution profile of peptides from the ferric column is shown in Figure 1 . 

The phosphopeptides eluted from the column were dried down to a small volume and 
sequenced using an Applied Biosystems 477A Protein Sequencer. 
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In a reaction in which 1% of the peptide mixture is phosphorylated, the total quantity 
of phosphopeptides is (1.8 mM) x (0.3ml) x (0.01) = 5.4 nmoles. Typically, about 1-2 nmoles 
5 of phosphopeptide mixture is added to the automated sequencer. This means that in a cycle in 
which all 15 residues are equally abundant, the yield of each amino acid is (1 nmol) x (1/15) 
= 60 pmoles. The abundance of each amino acid at a given cycle in the sequence of the 
phosphopeptide mixture was divided by the abundance of the same amino acid at the same 
cycle of the starting mixture to determine the relative abundance of each amino acid residue 

10 at a given cycle (i.e, corresponding to a given degenerate position in the peptide). In this 
way, variations in the abundance of amino acids at particular residue (i.e. position 3, the first 
degenerate residue) in the starting mixture or variations in yield of amino acids in the 
sequencer are divided out. If the kinase is insensitive to amino acid variations at a degenerate 
position, then the relative abundance of each amino acid at this cycle in the phosphopeptide 

1 5 mixture will be the same as in the starting mixture (i.e., the relative abundance of each amino 
acid will be 1 .0). If an amino acid residue is preferred at this particular position, it will have 
a relative abundance greater than 1 .0 whereas if the amino acid residue is selected against at 
this position it will have a relative abundance less than 1 .0. The data can be plotted in the 
form of a bar graph (see figures 2 and 3) to visually depict the relative abundance of each 

20 amino acid residue. 

In analyzing the sequencing data, two complexities must be considered. The first 
problem is that contamination from non-phosphorylated peptides that may elute from the 
ferric column together with the phosphorylated peptides will complicate the results. This 
problem has been assessed in two ways. First, a mock reaction in which the same procedure 

25 is followed but the protein kinase is omitted was performed. The fractions from the ferric 
column that normally contain the phosphopeptides (i.e., 1 1-14) are pooled, lyophilized and 
sequenced. The yield of total peptides (judged by quantity of Ala at cycle 2) in this pool is an 
indication of how much contaminating non-phosphorylated peptides are present in the kinase 
experiment. Typically, the yield in this mock experiment is about 0.1 % of the total starting 

30 peptide mixture. Upon sequencing, the column fractions containing the contaminating non- 
phosphorylated peptide were found to be enriched in Asp and GIu at every degenerate cycle, 
presumably due to interaction of these residues with theTe3+. Because of this 0.1 % of 
contaminating non-phosphorylated peptides, it is necessary to phosphorylate at least 0.2 % of 
the peptide mixture in order for the phosphopeptides to be more abundant than the 

35 non-phosphorylated peptides in the mixture pooled. Preferably, at least 0.5 % to 1 % of the 
peptide mixture is phosphorylated. 

The second procedure used to estimate non-phosphorylated peptides in the 
phosphopeptide mixture is to note the quantity of non-phosphorylated Ser at cycle 7 and 
compare it to the quantity of Ala at cycle 2. In the starting mixture the ratio of Ser at cycle 7 
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to Ala at cycle 2 is one. The actual measured ratio in the microsequencer is less than one 
because of the fall-off in yield with each cycle and the low recovery of Ser. However, the 
measured ratio for the starting material is reproducible. Since the phosphorylated peptides 
should have no non-phosphorylated Ser at cycle 7. the ratio of Ser at cycle 7 to Ala at cycle 2 
5 ought to be zero. Any value above zero can be assumed to be due to contamination with non- 
phosphorylated peptide and using the ratio calculated for the starting material, one can 
estimate the amount of contamination. In most experiments, this calculation was in good 
agreement with the contamination estimated from the mock experiment In those 
experiments where only approximately 0.2 % of the mixture was phosphorylated as judged 

1 0 by radioactivity, the ratio of Ser at cycle 7 to Ala at cycle 2 in the phosphopeptide mixture 
was relatively high (about 30% of the ratio in the starting material) indicating significant 
contamination. When higher yields were obtained, the ratio was lower. 

In some experiments where only about 0.5 % of the total mixture was phosphorylated, 
a correction was made for the 0.1 % contaminating nonphosphorylated peptides. A control 

1 5 mock reaction, as described above was performed and the amino acid abundance at each 
cycle from this control experiment was subtracted from the kinase experiment to correct for 
the background. To calculate the relative abundance of amino acids at each degenerate 
position, the corrected data were then compared to the starting mixture to create the ratios of 
abundance of amino acids. The sum of the abundance of each amino acid at a given cycle 

20 was normalized to 15 (the number of amino acids present) so that each amino acid would 
have a value of 1 in the absence of selectivity at a particular position. 

Ultimately, confidence in this procedure is provided by the reproducibility of the 
results obtained with a given kinase and by differences observed when different kinases are 
used to phosphorylate the same peptide library. 

25 

EXAMPLE 5: Determination of an Amino Acid Sequence Motif for the 

Phosphorylation Site of Protein Kinase A 

Since the peptide substrate specificity of protein kinase A (PKA) has been extensively 
30 studied, the oriented degenerate peptide library was first tested with this enzyme. The 
peptide library described in Example 1 in which Ser was fixed a position 7 was used. The 
library was incubated with PKA (catalytic subunit, from Sigma) under conditions were 
approximately 1 % of the total mixture was phosphorylated (as described in Example 2). The 
phosphopeptide products were then separated from the bulk of the non-phosphorylated 
35 peptides of the library using the ferric-IDA column and the mixture was sequenced (as 
described in Example 3). 

By comparing the sequence of the phosphopeptide products to that of the complete 
(i.e., starting) library, it was possible to determine which residues N-terminal and C-terminal 
to the phosphorylated serine were preferred for substrate recognition by calculating relative 
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abundance values for each amino acid residue at each degenerate position. The results arc 
presented as bar graphs in Figure 2. Box A represents the relative abundance of amino acid 
residues at position 3 in the mixture of phosphopeptides products. This is the first degenerate 
position of the library and is 4 residues N-terminal of the phosphorylation site (position 7). 
5 In general, Arg was preferred at each position N-terminal of the phosphorylated Ser (see 
boxes A, B, C and D) while hydrophobic residues were preferred at positions C-terminal of 
the Ser (see boxes E, F, G and H). The greatest selectivity was observed at residues -3 and -2 
relative to the Ser (boxes B and C): more than 60 % of the phosphopeptides have Arg at these 
positions. At the -3 position (relative to the Ser), His is the second best amino acid while at 

10 the -2 position (relative to the Ser), Lys is the second best. The least selectivity was observed 
at positions +2 and +3 relative to the Ser (boxes F and G). 

The amino acid residues at degenerate positions having a relative abundance value 
greater than 1 .0 are summarized below in Table 1 (the relative abundance value is in 
parentheses next to the amino acid residue and positions are indicated relative to the 

1 5 phosphorylation site at 0). More preferred amino acid residues within the motif have relative 
abundance values equal to or greater than 1 .5 and are highlighted in bold-face type: 



Table 1 : Preferred amino acid residues at positions surrounding the phosphorylation site of 
protein kinase A (SEQ ID NO:6) 



=4 


=2 


=2 


zl 


Q 


±1 


±2 


±2 


±4 


R(7.23) 


R (9.97) 


R (9.62) 


R(7.09) 


s 


I (2.49) 


1(1.81) 


F(1.62) 


1(3.74) 


H(233) 


H(1.87) 


K (1.66) 


H(l.jS2) 


(T) 


M (1.86) 


V(1.74) 


1(1.56) 


L(1.43) 






H(1.07) 






F(1.85) 


F(1.58) 


M(1.55) 


F(1.23) 












L(1.75) 




V(1.39) 


M(1.22) 












V(L70) 




L(I,30) 


V(1.18) 



20 

The preferred amino acid residues at positions surrounding the phosphorylation site of 
protein kinase A represent the amino acid sequence motif for the phosphorylation site as 
determined by the library screening. This amino acid sequence motif can be compared to 
25 known in vitro and in vivo peptide substrates and inhibitors for PKA. Such an analysis is 
shown below in Table 2: 
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Table 2: Comparison of the amino acid sequence motif determined by the peptide library to 
sequences at the same region of known substrates and i nhibitors of PKA* 

Peptidg/Prctein Sequence 

-4-3-2-1 0 +1+2+3+4 

5 

PKA substrate amino acid sequence motif RRRR S IIFI 
(SEQ ID NO: 6) H H K H T M V I L 

H F F M F 

L V M 

10 V L V 

Walsh inhibitor sequence in 

pseudo-substrate domain: GRRN A IHDI (SEQ ID NO: 15) 

Kemptide sequence L R R A S L G (SEQ ID NO: 16) 

1 5 Optimal peptide substrate based on 
fragment of Walsh inhibitor 

(40 fold lower Km than Kemptide) GRTGRRN S I (SEQ ID NO: 17) 

Phosphorylase kinase, a-subunit FRRL S ISTE (SEQ ID NO: 18) 

Phosphorylase kinase, b-subunit KRSG S VYEP (SEQ ID NO: 19) 

20 Glycogen synthase, site la PRRA s CTSS (SEQIDNO:20) 

Glycogen synthase, site lb KRSN S VDTS (SEQIDNO:21) 

Glycogen synthase, site 2 SRTLSVSSL (SEQIDNO:22) 

Pyruvate kinase LRRA S LAQL (SEQIDNO:23) 
6-Phosphofhicto-2-kinase/ 

25 fructose-2,6-bisphosphatase RRRG S SIPQ (SEQIDNO:24) 

Protein phosphatase inhibitor I RRRP T PATL (SEQIDNO:25) 

PP^O 5 " 5 ORRR s LEPA (SEQIDNO:26) 

* Substrate and inhibitor sequences are from Taylor et aL, (1990) Ann. Rev. Biochem. 
30 52:971-1005 and Walsh et aL, (1990) Peptides and Protein Phosphorylation (B.E. Kemp, ed.) 
CRC Press Inc. pp.43-84. 

Two observations can be made from this analysis. First, the most optimal substrates 
predicted from the library is in good agreement with the sequences of known substrates and 

35 inhibitors for PKA, supporting the validity of this technique for determining the substrate 
specificity of a kinase. Second, differences do exist between the preferred amino acid 
sequences selected by peptide library screening and the amino acid sequences of known 
substrates and inhibitors. Amino acid residues in the substrates and inhibitors listed in Table 
2 which differ from the determined amino acid sequence motif for PKA are indicated in bold 

40 type. 
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EXAMP LE fc 



Determinati n fAmin Acid Sequence Motifs 
for Cell Cycle Control Kinases 



10 



The substrate specificities of two other protein-serine/threonine kinases were 
examined using the serine-containing oriented degenerate peptide library as described for 
PKA in Example 5. The two kinases examined, cyclin B/p33 cc k2 and cyclin A/p33^K2, 
are kinases involved in cell cycle control. The results are presented as bar graphs in Figure 3 
(for cyclin B/^™* 02 ) and Figure 4 (for cyclin A/p33C DK2 ). 

The results of the library screening for these two kinases are summarized below in 
Tables 3 and 4, which present the amino acid residues at degenerate positions having a 
relative abundance value equal to or greater than 1 .0. More preferred amino acid residues 
within the motif have relative abundance values equal to or greater than 1 .5 and are 
highlighted in bold-face type: 



1 5 Table 3: Preferred amino acid residues at positions surrounding the phosphorylation site of 
cyclin B/p34 cd c2 (SEQ ID NO:8) 



4 


=1 


:2 


rl 


a 


±1 


±2 


±1 


±4 


R (2.72) 


R(2.50) 


P (2.16) 


M (1.67) 


S 


P (11.07) 


P(3.17) 


K(2.55) 


K (1.90) 


H(1.31) 


P(1.44) 


M(1.5) 


Q039) 


(T) 




K(1.68) 


R(1.85) 


I (1.86) 




M(1.40) 




A (1.26) 






M(1.18) 


P(1.38) 


R(1.45) 




V(1.33) 




N(1.23) 






A (1.07) 








K132) 




R(1.18) 






V(1.10) 












K(1.12) 













Table 4: Preferred amino acid residues at positions surrounding the phosphorylation site of 
cyclin A/p34CDK2 ( SE q ro NO:9 ) 



=1 



r2 



zl 



±1 



±2 



+4 



R (2.89) 



R(2.60) 



PQ.89) 



M (1.70) 



P(1L50) 



P (2.14) 



K(1.88) 



K(1.67) 



H (1.66) 



H (2.12) 



M (1.54) 



Q(l.S) 



(D 



M (1.58) 



R(L70) 



N (1.30) 



N(1.32) 



V(1.18) 



L(138) 



R(1.32) 



R(131) 



Qd.ll) 



P(M3) 



N(1.32) 



V(1.15) 



M(l.ll) 



A (121) 



K(l.ll) 



20 



25 



Although these two kinases are associated with distinct cyclins and are turned on at 
different stages of the cell cycle, the peptide substrates that they preferred for 
phosphorylation were nearly identical. A composite amino acid sequence motif, combining 
the two motifs for the two cell cycle control kinases, can be created having the sequence 
(SEQ ID NO: 7): 



WO 95/18823 



-34- 



PCT/US95/00147 



R-R- 


-P- 


-M- 


■S- 


p.p. 


■K- 


-K 


H H 


M 


Q 


T 


K 


R 


R 


N P 


L 


A 




M 


P 


I 


Q M 




N 




A 




N 


V 




R 




V 






I 




K 




R 







This composite amino acid sequence motif was compared to the amino acid sequences 
of known substrates for these cell cycle control kinases. This analysis is shown below in 
10 Table 5. Amino acid residues in known substrates of p33 C( * c 2 or p33^DK2 which differ 
from those in the determined amino acid sequence motif are shown in bold type. 



15 



20 



Table 5: Comparison of the amino acid sequence motif determined by the peptide library to 
sequences at the same region of known substrates of P 33c dc2 an H p ^CDK2* 



r.epii ue/ r ro lein 










Seauence 




-4 


-3 


-2 


-1 


n 
u 


-1-1 4-0 -1-1 -t-A 
' 1 


p33cacz and p 33CUKZ motif 


R 


R 


P 


M 


e 
0 


D D V V 

C IT JV JV 


o&y J.JJ vi\j t / j 


H 


TT 

H 


M 


Q 


T 


K R R 




N 


P 


L 


A 




M P I 




Q 


M 




N 




A N 






V 




R 




V 






I 




K 




R 


Known substrates 














Xenopus histone HI 


V 


A 


A 


K 


S 


P A K A (SEQIDNO:27) 




K 


A 


A 


K 


S 


P K K T (SEQE>NO:28) 


Chicken Lamin B2 


6 


T 


P 


L 


s 


P T R I (SEQIDNO:29) 


ppeO 0 "^ 


D 


T 


H 


R 


T 


P S R S (SEQIDNO:30) 


SV40LT Antigen 


S 


Q 


H 


S 


T 


P P K K (SEQIDNO-.31) 


Yeast SWI5 


Y 


T 


T 


N 


S 


P S K I (SEQIDNO:32) 




V 


I 


K 


R 


S 


P R K R (SEQIDNO:33) 


Predicted Sites 












Protein Phosphatase (T3 1 6) 


6 


R 


P 


I 


T 


P P R K (SEQIDNO:34) 


Strathmin 




P 


P 


L 


S 


P P K K (SEQID NO:35) 


HIV Enhancer-Binding Protein 2 


R 


R 


P 


V 


S 


P O K D (SEQIDNO:36) 


S6 Kinase II Alpha (S335) 


R 


R 


E 


M 


S 


P P P K (SEQIDNO:37) 


Elongation Factor 1 -gamma (216) 


D 


K 


P 


L 


S 


P P Q K (SEQIDNO:38) 


Histone H5 (SI 33) 


R 


K 


A 


R 


s 


PARK (SEQIDNO:39) 


Cell Division FTSZ protein (S548) 


R 


R 


P 


L 


s 


PEAS (SEQIDNO:40) 


Ubiquitin-Conjugating Enzyme 


D 


y 


P 


L 


s 


P P K L (SEQIDNO:41) 



* Substrate sequences are from Nigg, E.A. (1991) Seminars in Cell Biology 2:261-270. 
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EXAMPLE 7: Determinati n of Amino Acid Sequence Motifs 

for src Family Kinases 

The version of the oriented degenerate peptide library with tyrosine at position 7 was 
5 used to evaluate the substrate specificities of two protein-tyrosine kinases which are members 
of the src family of protein-tyrosine kinases. The src family kinases pp60 v " src and polyoma 
virus middle T/pp60 c " src were examined and bar graphs summarizing the results of the 
library screening are shown in Figure 5. The results of the library screening are also 
summarized below in Tables 6 and 7 and , which presents the amino acid residues at 
10 degenerate positions having a relative abundance value equal to or greater than 1.0. More 
preferred amino acid residues within the motif have relative abundance values equal to or 
greater than 1.5 and are highlighted in bold-face type: 

Table 6: Preferred amino acid residues at positions surrounding the phosphorylation site of 
15 ppepc-src^EQjpNQ.n) 



z4 


r2 


z2 


^1 


Q 


±1 


±2 


±a 


±4 


D(1.45) 


E (2.51) 


£ (1.96) 


I (3.04) 


Y 


G(2.24) 


£ (1.66) 


F(2.87) 


F(1.53) 


E(1.32) 


D(1.88) 


D (1.68) 


V(238) 




E(2.06) 




I (1.52) 


E(1.38) 






G (1.5) 


L(1.52) 








L(1.43) 










P(1.17) 








M(1.02) 





Table 7: Preferred amino acid residues at positions surrounding the phosphorylation site of 
p P 6QV-src (SEQ ID NO: 12) 



=4 


=a 


=2 


=1 


Q 


±1 


±2 


±2 


±1 


E (1.66) 


E(3.18) 


E (2.24) 


I (4.16) 


Y 


G (2.63) 


E (2.51) 


F(4.00) 


D(120) 


D(1.5) 


D(2J23) 


D (1.98) 


V(3.14) 




E (2.52) 


D(1.40) 


1(1.81) 


E(2.10) 






G (1.62) 


L(1.63) 




D (1.58) 




L(1.70) 








N(1.43) 


P(1.08) 








V(1.60) 


















M(1.17) 





20 The peptide substrates that are preferred for phosphorylation by these two members of 

the src family of tyrosine kinases were nearly identical. A composite amino acid sequence 
motif, combining the two motifs for the src family kinases, can be created having the 
sequence (SEQ ID NO: 10): E-E-E-I-Y-G-E-F-E 

DDDV EDIF 
25 G L D L D 

N P V 
M 



WO 95/18823 



-36- 



PCT/US95/00147 



10 



The sequences preferred by pp60C-src an d pp 60V-src sug gest that the catalytic site am* 
SH2 domain provide double selection for downstream signaling. The selection for Glu at 
positions -3 and -2 N-terminal of the phosphorylation site is in agreement with previous 
studies (Hunter et al. J. Biol. Ctem._2IZ:4843; Patchinsky et al. Proc. Natl. Acad. Sci. 
22:973; and Cooper et al. J. Biol. Chem. 252:7835). The surprising result is that He or Val at 
the -1 position is even more important than Glu at -2 or -3 in determining selectivity. An 
even more interesting surprise is that the preferred sequence C-terminal of the 
phosphotyrosine (PhosphoTyr-Gly/Glu-Glu-Phe/Ile; (SEQ ID NO: 42)) is very similar to the 
optimal sequence for binding to the SID domain of src family members (PhosphoTyr-Glu- 
Glu-Ile;(SEQ ID NO: 43); Songyang, Z., et al (1993) Cell 72:767-778). Thus, the active site 
of ppeO/v-src p re fe rs to phosphorylate substrates that its own SH2 domain can bind. 



This composite amino acid sequence motif was compared to the amino acid sequences 
of known substrates for these tyrosine kinases. This analysis is shown below in Table 8. 
1 5 Amino acid residues in known substrates of ppeoc-src or pp60 v " src which differ from those 
in the determined amino acid sequence motif are shown in bold type. 



20 



Table 8: Comparison of the amino acid sequence motif determined by the peptide library to 
sequences at the same region of known substrates of pp60 c_src and pp60 v_src * 



Pentide/Pmtetn 



Sequence 



-4-3-2-1 0 +1+2+3+4 



25 



30 



35 



40 



pp60C-src ^ pp 6ov-src mot if 
(SEQ ID NO: 10) 



Some good in vitro substrates. 
Gastrin 
Enolase (Y43) 
p34«fc2 peptide (Yl 5) 

in vhn substrata 

Mouse Polyoma middle t (Y3 1 5) 

Hamster Polyoma middle t (Y295) 

Hamster Polyoma middle t (Y321) 

Ptdlns 3-kinase p85 subunit (Y607) 

pp60v-src (Y416) Autophosphorylatio 

pp60v-src (Y92) Autophosphorylation 

pl25FAK(Y373)Phos. 

T-cell receptor £(Y1 10) 

T-cell receptor J; ryi ?7) 



E E E I 

D D D V 

G L 

N P 



E E A 
TGI 
E G T 



G E F E 
E D I F 
D L D 

V 

M 



Y G W M D (SEQ ID NO: 44) 

Y E A L E (SEQ ID NO: 45) 

Y G V V y (SEQ ID NO: 46) 



E 


E 


B 


y 


M 


P 


M E (SEQ ID NO: 47) 


E 


N 


E 


Y 


M 


P 


M A (SEQ ID NO: 48) 


E 


P 


Q 


y 


E 


E 


I P (SEQ ID NO: 49) 


E 


D 


Q 


Y 


S 


L 


V E (SEQ ID NO: 50) 


D 


N 


E 


Y 


T 


A 


R Q (SEQ ID NO: 51) 


L 


Y 


D 


Y 


E 


S 


W I (SEQ ID NO: 52) 


T 


D 


D 


Y 


A 


E 


I D (SEQ ID NO: 53) 


E 


G 


L 


Y 


N 


E 


L Q (SEQ ID NO: 54) 


A 


E 


A 


Y 


S 


E 


-L6 (SEQ TP NO; 55) 
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* Substrate sequences are from Cooper. J. A. (19901 Peptides and Protein Phosphorylati on 
(B.E. Kemp, ed.) CRC Press Inc. pp. 85-1 14; Geahlen, R.L. and Harrison, MX. (1990) 
Peptides and Protein Phosphorylation (B.E. Kemp, ed.) CRC Press Inc. pp. 239-254; 
5 unpublished results from Cantley and Schaffhausen labs and personal communication from T. 
Parsons; and Cheng, H.C., et al. (1991) J. Biol Chem. 266:17919-17925. 

EXAMPLE 8: Determination of Amino Acid Sequence Motifs 

for a Growth Factor Receptor 

10 

The tyrosine-containing oriented degenerate peptide library was used to determine the 
substrate specificity of a growth factor receptor tyrosine kinase, the epidermal growth factor 
receptor (EGF receptor). Bar graphs summarizing the results of the library screening are 
shown in Figure 6. The results of the library screening are summarized below in Tables 9, 
1 5 which presents the amino acid residues at degenerate positions having a relative abundance 
value equal to or greater than 1 .0: More preferred amino acid residues within the motif have 
relative abundance values equal to or greater than 1.5 and are highlighted in bold-face type: 

Table 9: Preferred amino acid residues at positions surrounding the phosphorylation site of 
20 the EGF receptor (SEQ ID NO: 13) 





=i 


=2 


zl 


a 


±1 


±2 


±1 


±A 


E (1.61) 


E (2.86) 


E (2.55) 


E(2.52) 


Y 


F(2.10) 


E (1.90) 


L(1.74) 


V(1.59) 


D(1.51) 


D (1.81) 


D(1.53) 


D (2.12) 




V(1.91) 


F(1.60) 


1(1.66) 


F(1.43) 


R(1.54) 




A (125) 


I (1.55) 




I (1.68) 


D(1.53) 


F(1.63) 


P(1.40) 


A (1.5) 




QOIO) 


N(1.09) 




E (1.50) 


V(1.40) 


V(1.50) 


E0.31) 






P(1.02) 


V(1.02) 




M(1.09) 


K1.27) 


M (120) 


L(1J0) 












L(1.04) 






I(12g) 



Like the src family tyrosine kinases examined in Example 7, the EGF receptor 
preferentially phosphorylates sequences with Glu at positions -2 and -3. The major 
difference between the optimal substrate for the cytosolic tyrosine kinases and the EGF 

25 receptor tyrosine kinase (a membrane-bound tyrosine kinase) is that the EGF receptor did not 
select strongly for peptides with lie and Val at the -1 position and it had a broad selection for 
peptides with the general motif Hydrophobic-Xaa-Hydrophobic-Hydrophobic C-terminal to 
the phosphorylation site (SEQ ID NO: 56). Interestingly, this is the general motif recognized 
by Type III SH2 domains (PhosphoTyr-Hydrophobic-Xaa-Hydrophobic; (SEQ ID NO: 57) 

30 Songyang, Z., et al (1 993) Cell 72:767-778). 

The preferred amino acid residues at positions surrounding the phosphorylation site of 
the EGF receptor represent the amino acid sequence motif for the phosphorylation site 
determined by the library screening. This amino acid sequence motif can be -compared to 
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known in vitro and in vivo peptide substrates of the EGF receptor. This analysis is shown 
below in Table 10, with amino acid residues in the known substrates which differ from the 
determined amino acid sequence motif indicated in bold type. 

Table 10: Comparison of the amino acid sequence motif determined by the peptide library to 
sequences at the same region of known substrates of the EGF receptor* 

Pentide/Protein Site 

-1 -2 -3 -4 +1 +2 +3 +4 



EGF receptor amino acid sequence motif EEEE Y FELV 
(SEQIDNO:13) DDDD VPIF 

R A I I D F P 

A Q N E V V E 

15 P V M I M L 

L I 



in vivo substrates 



20 



25 



30 



35 



EGF receptor autophos. site 


(Y1069) 


P 


V 


P 


E 


Y 


I 


N 


Q 


S 


(SEQIDNO: 58) 




(Y1086) 


Q 


N 


P 


V 


Y 


H 


N 


Q 


P 


(SEQIDNO: 59) 


it 


(Y111R) 


6 


N 


P 


E 


Y 


L 


N 


T 


V 


(SEQIDNO: 60) 


M 


(Y1148) 


D 


N 


P 


D 


Y 


Q 


Q 


D 


F 


(SEQIDNO: 61) 


ft 


(Y1173) 


E 


N 


A 


E 


Y 


L 


R 


V 


A 


(SEQIDNO: 62) 


It 


(Y992) 


D 


A 


D 


E 


Y 


L 


I 


P 


Q 


(SEQIDNO: 63) 


Lipocortin(Y21) 




E 


E 


Q 


E 


Y 


V 


Q 


T 


V 


(SEQIDNO: 64) 


Myosin light chain 




V 


D 


E 


M 


Y 


R 


E 


A 


P 


(SEQIDNO: 65) 


it 




6 


N 


F 


N 


Y 


V 


E 


F 


T 


(SEQIDNO: 66) 


Phospholipase C-y (Y771) 




A 


E 


P 


D 


Y 


6 


A 


L 


Y 


(SEQIDNO: 67) 


(Y1254) 




F 


E 


A 


R 


Y 


Q 


Q 


P 


F 


(SEQIDNO: 68) 


in vitro substrates 
























Gastrin 




E 


E 


E 


A 


Y 


G 


w 


M 


D 


(SEQIDNO: 69) 


p21 ras peptide 




6 


Q 


E 


E 


Y 


s 


A 






(SEQIDNO: 70) 



* Substrate sequences are from Geahlen, R.L. and Harrison, M.L. (1990) Peptides and . 
Protein Phosphorylation (BE. Kemp, ed.) CRC Press Inc. pp. 239-254; Varticovski, L., et al. 
(1988) Biochemistry 22:3682-3690; Wahl, M.I., etal. (1990)7. Biol. Chem. 265:3944-3948; 
and Kim, J.W., et al. (1990) J. Biol. Chem. 265:3940-3943. 
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EXAMPLE 9: Determination of Amin Acid Sequence Motifs 

for a the pp92 c - f P s/f ^ Tyrosine Kinase 

The tyrosine-containing oriented degenerate peptide library was used to determine the 
5 substrate specificity of the pp92 c ~*P s ^ es tyrosine kinase. Bar graphs summarizing the results 
of the library screening are shown in Figure 7. The results of the library screening are 
summarized below in Table 1 1 , which presents the amino acid residues at degenerate 
positions having a relative abundance value equal to or greater than 1.0: More preferred 
amino acid residues within the motif have relative abundance values equal to or greater than 
10 1 .5 and are highlighted in bold-face type: 



Table 1 1 : Preferred amino acid residues at positions surrounding the phosphorylation site of 
pp9 2 c-fps/fes (SEQIDNO:14) 



A 


=2 


=2 


zl 


4 


±1 


±2 


±2 


±4 


£(2.01) 


E (2.79) 


E (2.72) 


I (3.46) 


Y 


E (4.53) 


E (2.70) 


I (2.62) 


E(2.68) 


A (1.94) 


A (L60) 


A (1.5) 


E (1.94) 




D(1.78) 


D (2.70) 


V(2^2) 


1(1.77) 


D(1.5) 




G(l .23) 


V(1.5) 




A (125) 


1(1.29) 


E (2.06) 


V(1.63) 








D(1.14) 






V(1.14) 


M(129) 


D(1.53) 








L(1.10) 








L(U1) 


G(1.14) 



1 5 The pp92 c ~fP s 'f es tyrosine kinase selected sequences similar to those selected by the 

src tyrosine kinases with the exception that He and Val instead of Phe were preferred at the 
+3 position. The SH2 domain of pp92 c -*P s/fes , like that of is in Group I 

(Songyang, Z., et al (1993) Cell 22:767-778) and the two kinases prefer similar optimal 
phosphopeptides for binding. Thus, the preferred substrates of pp92 c *Q >s/ f es would produce 

20 products that would bind to the pp92 c " f P s/fes SH2 domain or to src-family SH2 domains. 

The preferred amino acid residues at positions surrounding the phosphorylation site of 
pp92 c "*P s/ f es represent the amino acid sequence motif for the phosphorylation site 
determined by the library screening. This amino acid sequence motif can be compared to 
25 known in vivo peptide substrates of pp92 c " f P s/fes . This analysis is shown below in Table 12, 
with amino acid residues in the known substrates which differ from the determined amino 
acid sequence motif indicated in bold type. 



WO 95/18823 



-40- 



PCT/US95/00147 



5 



10 



20 



Table 12: Comparison of the amino acid sequence motif determined by the peptide library to 
sequences at the same region of known substrates of c-fes/fps* 



Peptide/Prntein 



c-fes/fps amino acid sequence motif 
(SEQ ID NO: 14) 



Known in vivn substrates. 

Chicken lactate dehydrenase(y238) VESA Y E.VIR (SEQ ID NO: 71) 

Human Enolase(Y43) STGI Y E A L E (SEQ ID NO: 72) 

15 P140gag-fps(Y424) EDGV Y ASTG (SEQ ID NO- 73) 
P110gag-fes(Y500) 



Seauenee 


-4 


-3 


-2 


-1 
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A I E V 
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V M D 








L 
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S 
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E A L E 1 
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D 
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V 


Y 


A S T G l 


A 


D 


G 


I 


Y 


A A S G | 



* Substrate sequences are from Kemp, B.E. (ed.) (1 990) Peptides and Pmt^n 
Phosphorylation CRC Press Inc. 

EXAMPLE 10; Kinetic Evaluation of Predicted Optimal Peptide Substrates 



The peptide sequences predicted to be optimal substrates for the cyclin-dependent 
serine/threonine kinases and for pp60 v *src tyrosine kinase were synthesized to determine 

25 whether they are indeed good substrates for these kinases. The peptide Ala-Arg-Arg-Pro- 
Met-Ser-Pro-Lys-Lys-Lys-Ala (SEQ ID NO: 75) was synthesized as a substrate for cyclin 
containing kinases. The peptide Ala-Glu-Glu-Glu-IIe-Tyr-Gly-Glu-Phe-Glu-Ala-Lys-Lys- 
Lys-Lys (SEQ ID NO: 76) was synthesized as a substrate for src tyrosine kinases and was 
also tested with the EGF receptor. The kinetic studies were carried out as follows. Purified 

30 kinases were immobilized on protein A beads (ppW-src and EGF receptor) or GST beads 
(cyclin B/p33«fc2 and cyclin A/p33 c °K2). Kinase reactions were performed in 20 ul of 50 
mM Tris pH 7.0 buffer containing 50 mM NaCl, 10 mM MgCl2 (v-src and cyclin-containing 
kinases) or MnCl 2 (EGF receptor), 10 uM ATP 5 uCi [y-32p]. A TP (3000 mCi/mmol, NEN) 
and various concentrations of peptides. For the experiment measuring the competitive 

35 inhibition of v-src activity by the src motif-containing peptide, 1 uM (final concentration) 
acid treated enolase was included. After 2 and a half minutes incubation, the supematants 
were spotted on phosphocellulose paper (p81), washed four times with 75 mM phosphoric 
acid, and radioactivity counted on a scintillation counter. For phosphorylation of enolase, the 
reactions .were stopped by adding SDS loading buffer and the proteins were resolved on 10 % 

40 SDS-PAGE gels. The Km and Vmax were calculated using the Kaleidagraph computer 
software. The results are summarized below in Table 13. 



WO 95/18823 



-41- 



PCT/US95/00147 



Table 13: Kinetics of Different Kinases Phosphorylating Synthetic Peptides 

Vmax 

5 Kinase Eepiide KmiuM) Ki ( ^lv^ <>mol/mm/m g^ 

cyclinB/p33cdc2 ARRPMSPKKKA 2.4 

cyclin A/p33 CDK2 ARRPMSPKKKA 1 5 

pp60v-src AEEEIYGEFEAKKKK 32.9 8* 0.8 

10 (Y15):IEKIGEGTYGWYKGRHKTT 139.4 0.6 

EGF receptor AEEEIYGEFEAKKKK 34.0 



* As assayed by inhibition of transphosphorylation of acid denatured enolase. 

15 

The peptide containing the predicted optimal substrate motif for the cyclin-containing 
kinases was phosphorylated by cyclin B/p33 C( k2 with a Km of 2.4 jiM and by cyclin 
A/p33 CDK2 with a Km of 1 5 pM. These values indicate higher affinities than have 
previously been found with peptide substrates. The peptide containing the predicted optimal 
20 substrate motif for the src kinases was an excellent substrate for ppeOV' 5 * 0 , with a Km of 33 
\iM and Vmax of 0.8 ^mol/mg/min. In comparison, a synthetic peptide based upon the site 
of tyrosine phosphorylation on p33 c ^c2 (Y15; SEQ ID NO: 77), the best substrate previously 
reported for pp60 v - src , was phosphorylated with a KM of 139 ^M and Vmax of 0.6 
mmol/mg/min. The src-substrate peptide was also found to be an excellent competitive 
25 inhibitor of enolase phosphorylation by ppeov-^ (K 50 o /o = 8 jiM). In view of the similarity 
between optimal substrates for the various tyrosine kinases, it was not surprising that this 
peptide was also a good substrate for the EGF receptor (Km = 34 \iM). 

EXAMPLE 11 : Amino Acid Sequence Motifs for Additional 

Protein Tyrosine Kinases 

The tyrosine-containing oriented degenerate peptide library was used to determine the 
amino acid sequence motifs for the substrates of five additional protein tyrosine kinases: Ick, 
c-abl, PDGF receptor, FGF receptor and insulin receptor. The results of the library screening 
are summarized below in Tables 14-18, which presents the amino acid residues at degenerate 
positions having a relative abundance value equal to or greater than 1.0: More preferred 
amino acid residues within the motif have relative abundance values equal to or greater than 
1 .5 and are highlighted in bold-face type. "X" indicates that any amino acid is permissible at 
this amino acid position: 
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Table 14: Preferred amino acid residues at positions surrounding the phosphorylation site of 
lck (SEQ ID NO: 78) 



rft 




zA 


i 

zL 




±1 


±2 


±2 


±4 


X 


E(L5) 


X 


I (3.4) 


Y 


G (2.2) 






Y (1.7) 




G(l.l) 




V(3.0) 




A (1.6) 


A (1.4) 


V(2.2) 


M1.7) 




P(i.l) 




L(23) 




M(1.4) 


P(l-3) 


F(2.2) 


V(1.6) 




A(l.l) 




F.(1.6) 




E(1.3) 


M(1.3) 


1(2.0) 


1(1.5) 








M(1.3) 




V(l.l) 


1(1.3) 


M(1.2) 


P(l-3) 














F(l.l) 




M(1J) 














K(l.l) 




K(1J) 














L(l.l) 




0(1.1) 



5 

Table 15: Preferred amino acid residues at positions surrounding the phosphorylation site of 
c-abl(SEQIDNO:79) 





-Jl 


=2 


zl 


0 


±1 


±2 


±2 


+4 . 


A (1.7) 


X 


V(L5) 


K3.2) 


Y 


A (2 J) 


A (1.6) 


P(2J) 


F(1.5) 


R(13) 




A (1.4) 


V(1.8) 




M(1.5) 


V(1.4) 


F(1.9) 


L(1.4) 


0(1.2) 




P(l-4) 


L(1.5) 




Q(14) 


F(1.3) 


L(1J) 


103) 


P(l.l) 




1(13) 


M(1.2) 




0(1.3) 


M(1.2) 


K1.2) 


M(1.3) 






0(12) 






F (1.1) 


Q(l.2) 


V(l.l) 


0(1.1) 






E(l.l) 








L(l.l) 


A (1.1) 


P(l.l) 














1(1.1) 







10 Table 16: Preferred amino acid residues at positions surrounding the phosphorylation site of 
PDGF receptor (SEQ ID NO: 80) 



A 


=2 


=2 


=1 


fl 


±1 


±2 


±2 


±4 


E(1.6) 


E (2.0) 


E (2.0) 


E(2J) 


Y 


V(2.0) 


F(1.6) 


1(1.9) 


X 


D(l.S) 


D(1.5) 


D(1.6) 


D(1.6) 




E(1.7) 




V(1.7) 














1(1.5) 




F0.5) 





WO 95/18823 



-43- 



PCT/DS95/00147 



Table 17: Preferred amino acid residues at positions surrounding the phosphorylation site of 
FGF receptor (SEQ ID NO: 81) 





zA 




rl 


Q 


±1 


±2 


±2 


±4 






Hi \e*OJ 




Y 
I 


F f 1 Ol 


r \?.V) 


T it fi\ 


F(1.7) 


K(1.4) 


A (1.5) 


K(1.4) 


1(14) 




V(LS) 


V(1.4) 


F(1.8) 


L(1.4) 


E(l.3) 


F(1.4) 


A (1.3) 


V(1.2) 




E(1.4) 


M(1.4) 


M(1.7) 


M(1.4) 


F(l^) 


K(1.4) 


F(l.l) 


D(l.l) 




1(1.2) 


1(1.2) 


I(l«) 


V(1.4) 


POD 


M(l.l) 


POD 


F(l.l) 




M(1.2) 


L(1.2) 


V(1.6) 


E(1.3) 


G(l.l) 


0(1.1) 


0(1.1) 


A (1.1) 




A (1.2) 


E(l.l) 


E(1.2) 


K13) 




1(11) 


V(l.l) 


L(l.l) 




K(12) 


P(l.l) 




P(1.3) 




V(l.l) 










K(l.l) 







5 

Table 18: Preferred amino acid residues at positions surrounding the phosphorylation site of 
insulin receptor (SEQ ID NO: 82) 



=4 


=a 


=2 


=1 


fi 


±1 


±2 


±2 


±4 


X 


E(1.7) 


E(1.8) 


E(1.6) 


Y 


M (2.8) 


M(2J2) 


M(2.2) 


M(1.9) 




0(1.3) 


A (1.4) 


N(1.5) 




F(1.6) 


F(1.6) 


F (1.9) 


F(1.9) 




A (1.3 


0(13) 


D(1.5) 




A (1.4) 


V(U) 


1(1.5) 


E(IS) 




F(1.2) 


P(1.3) 


A (12) 




E(1.3) 


E(1.3) 


E(1.5) 


V(1.4) 




N(l.l) 


N(l.l) 


K1.2) 




V(l.l) 


A (12) 


V(1.4) 


L(1.3) 




M(l.l) 




POD 






D(l.l) 




K1.2) 








0(1.1) 






L(l.l) 




0(1.1) 














1(11) 







EXAMPLE 11 : Amino Acid Sequence Motifs for Additional 

1 0 Protein Serine/Threonine Kinases 

The serine-containing oriented degenerate peptide library was used to determine the 
amino acid sequence motifs for the substrates of five additional protein serine/threonine 
kinases: casein kinase II, NIMA, phosphorylase kinase, Cam kinase II and Erkl . The results 
15 of the library screening are summarized below in Tables 19-23, which presents the amino 
acid residues at degenerate positions having a relative abundance value equal to or greater 
than 1.0: More preferred amino acid residues within the motif have relative abundance 
values equal to or greater than 1.5 and are highlighted in bold-face type. 
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Table 19: Preferred amino acid residues at positions surrounding the phosphorylation site of 



casein kinase II (SEQ ID NO: 83) 



=4 


=2 


=2 


=1 


Q 


±1 


±2 


±2 


±4 


E (2.2) 


D(3.2) 


E (2.8) 


E(2.7) 


S 


E (3.0) 


D(3.4) 


E(3.4) 


E(4.1) 


D (1.8) 


E(2.2) 


D(2.6) 


D (2.0) 


T 


D (2.9) 


E(23) 


D (3.0) 


D(2.1) 


A (1.8) 


G(1.4) 


A(l.l) 


0(1.2) 




A (1.5) 


A (1.6) 


A (1.4) 


1(13) 


G(1.2) 


A (1.3) 


0(1.1) 


A (1.2) 


















M(1.2) 













5 

Table 20: Preferred amino acid residues at positions surrounding the phosphorylation site of 
N1MA (SEQ ID NO: 84) 



=4 


=2 


=2 


zl 


Q 


±1 


±2 


±2 


±4 


11(2.0) 


F(73) 


R(4.5) 


R (4.6) 


S 


R(2.6) 


R (2.0) 


M (1.9) 


1(2.1) 


N(1.5) 


L(23) 


K(1.2) 


K(1.6) 


T 


1(1.7) 


1(2.0) 


1(1.8) 


F(1.7) 


F(1.4) 


M(1.7) 


N(l 2) 


M (1.4) 




V(1.7) 


M (1.9) 


F(1.7) 


M(1.7) 


M(1.2) 






Q(ll) 




M(1.7) 


V(1.6) 


V(1.5) 


0(1.3) 


Q(l.l) 










L(1.4) 


F(1.5) 


L(1.2) 


L(l^) 


G(l.l) 










Q(1.2) 


L(l-2) 


0(1.1) 


V(1.2) 



1 0 Table 2 1 : Preferred amino acid residues at positions surrounding the phosphorylation site of 



Phosphorylase Kinase (SEQ ID NO: 85) 



=4 


=2 


=2 


=1 


a 


±1 


±2 


±2 


±1 


F(1.7) 


R(5.7) 


M(2.0) 


M(2.2) 


S 


F(3.9) 


F(2.5) 


L(2.1) 


F(23) 


M(1.5) 


K(2.5) 


R(1.8) 


F(2.0) 


T 


1(2.4) 


R(2.0) 


1(2.0) 


L(1.8) 


K(1.4) 




Q(1.7) 


L(1.8) 




M(1.7) 


K(2.0) 


M(1.2) 


1(1.7) 


1(14) 




F(1.6) 


1(1.5) 




L(1.6) 


M(1.4) 


P(l-2) 


V(1.4) 


H(1.4) 




K(M) 


K(1.2) 




V(L2) 


1(1.2) 


V(l.l) 


P(U) 


V(l.l) 




L(l.l) 


V(l.l) 










M(1.2) 



WO 95/18823 



-45- 



PCT/US95/00147 



Table 22: Preferred amino acid residues at positions surrounding the phosphorylation site of 
Cam Kinase II (SEQ ID NO: 86) 



-4 


-3 


-7 


.1 


o 

it 


+1 


+2 






K (2.3) 


R(3.9) 


Q(4.5) 


Q(2.1) 


s 


F(4.8) 


D(2.5) 


L(1.7) 


F(1.7) 


F<L5) 


K(2.7) 


M (1.8) 


M(2.0) 


T 


1(19) 


E(1.7) 


M(1.7) 


K(M) 


D(1.2) 


F(1.3) 


K(l.l) 


K(1.9) 




M(1.7) 


1(1.5) 


K(1.(S) 


M(13) 


H(12) 


M(1.2) 


A (1.1) 


L(1.8) 




L(1.6) 


M(1J) 


1(13) 


V(1.3) 


M(l.l) 




F(l.l) 


F(1.5) 




V(1.3) 


K(1.3) 


V(1.3) 


L(1.3) 








A(l.l) 






V(l^) 


Q03) 


1(1.2) 














L(l.l) 


G(l.l) 


P(l.l) 



Table 23: Preferred amino acid residues at positions surrounding the phosphorylation site of 
Erkl (SEQ ID NO: 87) 



A 


=2 


=2 


=1 




±1 


±2 


±2 


±4 


T (2.0) 


G(3.2) 


P(6.5) 


L(2.0) 


s 


P 


G (2.1) 


P(2-4) 


F(2J) 


P(13) 


P(1.5) 


L(2.1) 


M (1.9) 


T 




P(1.9) 


F(1.9) 


Y(1.7) 


S(1.4) 


E(1.5) 


1(13) 


P(1.7) 






F(1.3) 


G(1.5) 


1(1.2) 


QO-3) 


Y<1J> 


V(U) 


K(1.4) 






T(U) 


Y(1.5) 


L(1.2) 


1(1.2) 


QO-2) 




QO-2) 






L(1.2) 


L(l.l) 




V(1.2) 






TOD 








Kl.l) 




N(1.2) 


















E(1.2) 



















10 EXAMPLE 13! Comparison of Optimal Substrates of abl and src Kinases 

Optima] substrate peptides for (he src and abl protein tyrosine kinases were 
synthesized based on the determined amino acid sequence motifs for each kinase. Hie 
synthetic src and able optimal peptides were compared for their ability to act as specific, high 

15 affinity substrates for src or abl. The synthetic src optimal peptide had the amino acid 

sequence AEEEIYGEFEAKKKK (SEQ ID NO: 76). The synthetic abl optimal peptide had 
the amino acid sequence EAIYAAPFAKKK (SEQ ID NO: 88). The two peptides were 
phosphorylated by GST-fusion constructs of src and abl. Different concentrations of peptides 
were phosphorylated by the kinases in reactions containing 10 mM MnCl2, 50 mM Tris pH 

20 7.5, 2 mM DTT, 10 ^iM ATP and 5 *iCi [y-^Pl-ATP for 3 minutes at 25 ° C. The amount of 
radioactivity incorporated into the peptides was determined using a phosphocellulose assay. 
The results of the phosphorylation experiments are illustrated graphically in Figure 8, panels 
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A and B. The synthetic src optimal peptide was found to be an excellent substrate for src 
(K m = 33 nM, V max = 0.8 ^mol/mg/min)(see Figure 8B), but a poor substrate for abl. 
Similarly, the synthetic abl optimal peptide was favorably phosphorylated by abl (Km = 4 
HM, Vmax = 0.5 ^mol/mg/min)(see Figure 8A), but poorly phosphorylated by src. 

5 

EXAMPLE 14 : Preferential Phosphorylation of Sequences that 

Bind to Specific SH2 Domains 

A comparison of the optimal peptide substrates of protein tyrosine kinases (PTKs) to 

10 the phosphopeptide motifs selected by individual Src homology region 2 (SIC) domains (see 
Songyang, Z. et al. (1993) Cell 22:767-778; Songyang, Z. et al. (1994) Mol Cell. Biol 
14:2777-2785) revealed that individual protein tyrosine kinases preferentially phosphorylate 
sequences that bind to specific SH2 domains. The receptor PTKs phosphorylated peptides 
with the general motif Tyr-hydrophobic-X-hydrophobic, characteristic of peptides recognized 

15 by Group III SH2 domains (e.g., p85, PLC-y, SHPTP2). In contrast, the SH2-containing 
PTKs preferentially phosphorylated peptides with the motif Tyr-hydrophilic-hydrophilic- 
hydrophobic, recognized by Group I SH2 domains. Even more interesting is the finding that 
some of the SH2-containing PTKs selectively phosphorylate sites recognized by their own 
SIC domains. For example, the optimal substrate for the fps/fes PTK contains the motif 

20 YEEI/V and the optimal substrate for the fps/fes SH2 domain is pYEEV/L This selection is 
not due to the ability of the SH2 domain to protect the product from phosphatases or to 
present the substrate to the catalytic domain since similar motifs were found in experiments 
using kinase constructs lacking SH2 domains. 

To investigate further whether cytosolic PTKs preferentially phosphorylate sites 

25 recognized by their own or related SH2 domains, the ability of the abl or Ick SH2 domain to 
bind peptides phosphorylated by either abl or lck was examined. In these experiments, a 
tyrosine-containing degenerate peptide libraiy as described in Example 1 was phosphorylated 
by purified lck or GST-c-abl with [y- 32 P]-ATP such that a small fraction (0.001%) of the 
peptides were phosphorylated. Synthetic degenerate phosphopeptides (30 ^M; as described 

30 in Songyang, Z. et al. (1993) Cell 22:767-778) were also added to the reaction buffer to 
prevent binding of the labeled peptides to endogenous SH2 domains in the kinases. The 
labeled phosphopeptide mixture was added to immobilized GST-SH2 domains (300 nM) with 
1 of synthetic phosphopeptides. To correct for background binding, control experiments 
were performed in the same solution with 20 mM phenyl phosphate. After 2 hours of 

35 incubation, the pellets were washed quickly with 1 ml ice-cold PBS and the radioactivity 

associated with the pellets was measured. The results are shown graphically in Figure 9. The 
plot shows the percentages of the total phosphorylated peptides that bound to SH2 domains 
of c-abl and lck. Error bars indicate the standard error (n=3). The data indicate that the abl 
SH2 domain bound a large fraction of peptides phosphorylated by the abl PTK but a smaller 
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fraction of peptides phosphorylated by the lck PTK. In contrast, the lck SH2 domain 
preferentially bound peptides phosphorylated by the lck PTK. 

EXAMPLE IS: Mutation of the RET Protein Tyrosine Kinase 

5 Affects Substrate Specificity 



The following studies with the RET receptor protein tyrosine kinase provide evidence 
that catalytic site specificity of PTKs is critical for fidelity in downstream signaling. A 
germline mutation affecting the catalytic core of the RET receptor PTK results in an 

10 autosomal dominantly-inherited cancer syndrome referred to as multiple endocrine neoplasia 
type 2B (MEN 2B) (see Hofstra, R.M. et al. (1994) Nature 26Z:375-376; Eng, C. et aL (1994) 
Hum. Mol Genet 2:237-241; and Carlson, K.M. et al. (1994) Proc. Natl Acad Sci. USA 
21:1579-1583). The mutated residue is in a region of domain VIII of the enzyme that has 
been implicated in peptide substrate selectivity of protein Ser/Thr kinases (see Songyang, Z. 

15 et al. (1994) Current 4:973-982; Hanks, S.K. et al. (1988) Science 241:42-52; and 

Knighton, D.R. et al. (1991) Science 252:414-420). The wild-type RET protein has Met at 
amino acid position 91 8, as found in most other receptor PTKs. In the mutant RET protein 
found in MEN 2B, this residue is mutated to Thr, making this region more similar to SH2- 
containing PTKs. 

20 The ability of baculovirus-expressed cytoplasmic domains of either wild-type (WT) or 

MEN 2B mutant (2B) RET to phosphorylate three different peptide substrates was examined. 
The RET proteins were immunoprecipitated from insect cells using phosphotyrosine-specific 
antibodies and used to phosphorylate either an EGF receptor optimal substrate peptide, an 
able optimal substrate peptide or a src optimal substrate peptide (each at 10 nM) in the 

25 presence of [y-32P]-ATP. The amount of radioactivity incorporated was determined using a 
phosphocellulose assay. The incorporated activity was normalized such that RET WT and 
2B had the same autophosphorylation activity. The results are shown graphically in Figure 
10. The error bars indicate the difference between two experiments. The data indicate that 
the wild-type RET protein preferentially phosphorylates the optimal peptide substrate for the 

30 EGF receptor while the mutant RET protein preferentially phosphorylates the optimal peptide 
substrates for abl and src. These data suggest that phosphorylation of inappropriate substrates 
by the MEN 2B RET PTK results in the stimulation of pathways normally mediated by SH2- 
containing PTKs, probably involving signaling proteins with Group I SH2 domains. 

35 EQUIVALPNTS 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following claims. 



WO 95/18823 



-48- 



PCTYUS95/00147 



SEQUENCE LISTING 



10 



15 



20 



25 



30 



35 



40 



45 



50 



5.5 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: BETH ISRAEL HOSPITAL 

(B) STREET: 330 BROO KLINE AVENUE 
<C) CITY: BOSTON 

(D) STATE: MASSACHUSETTS 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 02215 

(G) TELEPHONE: (6X7) 735-4585 

(ii) TITLE OF INVENTION: Substrate Specificity of Protein Kinases 
(iii) NUMBER OF SEQUENCES: 88 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LAHIVE & COCKFIELD 

(B) STREET: 60 STATE STREET, suite 510 

(C) CITY: BOSTON 

(D) STATE: MASSACHUSETTS 

(E) COUNTRY: USA 

(F) ZIP: 02109-1875 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: ASCII text 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/178,570 

(B) FILING DATE: JANUARY 7, 1994 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: DeConti, Giulio A., Jr. 

(B) REGISTRATION NUMBER: 31,503 

(C) REFERENCE /DOCKET NUMBER: BBI-004CPPC 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 227-7400 

(B) TELEFAX: (617) 227-5941 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(v) FRAGMENT TYPE: internal 
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(ixj FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 1-9 

(D) OTHER INFORMATION: /note* "Xaa, if present, is any amino 

5 acid" 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION : 10 

10 (D) OTHER INFORMATION: /note= "Xaa is any amino acid" 



15 



20 



25 



30 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 11 

(C) OTHER INFORMATION: /note= "Xaa is Ser, Thr or Tyr" 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 12 

<D) OTHER INFORMATION: /note= "Xaa is any amino acid" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 13-21 

(D) OTHER INFORMATION: /note= "Xaa, if present, is any amino 

acid" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 10 15 



35 



Xaa Xaa Xaa Xaa Xaa 
20 

(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

45 (v) FRAGMENT TYPE: internal 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 1-24 

(D) OTHER INFORMATION: /note* "Xaa, if present, is any amino 

acid" 



(ix) FEATURE: 

(A) NAME /KEY: Modified- site 
55 (B) LOCATION: 25 

(D) OTHER INFORMATION: /note- "Xaa is any amino acid" 



(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 
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(B) LOCATION: 26 

(D) OTHER INFORMATION: /note= w Xaa is Ser, Thr or Tyr" 

(ix) FEATURE: 
5 (A) NAME/KEY: Modif ied-site 

(B) LOCATION: 27 

(D) OTHER INFORMATION: /note- "Xaa is any amino acid" 

(ix) FEATURE: 
10 (A) NAME/KEY: Modif ied-site 

(B) LOCATION: 28-51 

(D) OTHER INFORMATION: /note= "Xaa, if present, is any amino 

acid" 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
15 10 15 

20 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

20 25 30 



25 



30 



35 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa 
50 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

40 (ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3-6 

(D) OTHER INFORMATION: /note- "Xaa is any amino acid" 

45 (ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note- "Xaa is Ser, Thr or Tyr" 

50 (ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 8-11 

(D) OTHER INFORMATION: /note= "Xaa is any amino acid* 
55 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 3: 

Met Ala Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ala Lys Lys Lys 
15 10 15 
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(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

10 (v) FRAGMENT TYPE: internal 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
<B) LOCATION: 1-4 

15 (D) OTHER INFORMATION: /note= "Xaa is any amino acid" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 

20 (D) OTHER INFORMATION: /note= "Xaa is Ser, Thr or Tyr" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 6-9 

25 (D) OTHER INFORMATION: /note= "Xaa is any amino acid" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
30 .1 5 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



40 



(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 



(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 
45 (B) LOCATION: 1-18 

(D) OTHER INFORMATION: /note* "Xaa, if present, is any amino 

acid" 

(ix) FEATURE: 
50 (A) NAME/KEY: Modif ied-site 

(B) LOCATION: 19 

(D) OTHER INFORMATION: /note= "Xaa is any amino acid" 

(ix) FEATURE: 
55 (A) NAME/KEY: Modif ied-site 

(B) LOCATION: 20 

(D) OTHER INFORMATION: /note= "Xaa is Ser, Thr or Tyr" 



(ix) FEATURE: 
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10 



15 



30 



(A) NAME/KEY: Modified- site 

(B) LOCATION: 21 

(D) OTHER INFORMATION: /note* "Xaa is any amino acid" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 22-39 

(D) OTHER INFORMATION: /note= "Xaa, if present, is any amino 

acid" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
15 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
20 25 30 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
20 35 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /note- Xaa is Arg or His 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOACTION: 2 

<D> OTHER INFORMATION: /note= Xaa is Arg or His 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note« Xaa is Arg, Lys or His 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /note* Xaa is Arg or His 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note= Xaa is Ser or Thr 



(ix) FEATURE: 
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(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /note* Xaa is He, Met, Phe, Leu or Val 

5 (ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= Xaa is He, Val or Phe 

10 (ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /note= Xaa is Phe, He, Met, Val or Leu 

IS (ix) FEATURE: 

(A) NAME/ KEY: Modified- site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /note= Xaa is He, Leu, Phe, Met or Val 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 

25 (2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



35 



40 



45 He 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /note= Xaa is Arg, His, Asn or Gin 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= Xaa is Arg, His, Pro, Met, Val or 



( ix) FEATURE : 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 3 

50 (D) OTHER INFORMATION: /note* Xaa is Pro, Met or Leu 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 4 

55 (D) OTHER INFORMATION: /note= Xaa is Met, Gin, Ala, Asn, Arg or 

Lys 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
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(B) LOCATION: 5 

(D) OTHER INFORMATION: /note=* Xaa is Ser or Thr 

(ix) FEATURE: 
5 (A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= Xaa is Pro, Lys, Met, Ala, Val or 

Arg 

10 (ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /note= Xaa is Lys, Arg or Pro 

15 (ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /note= Xaa is Lys, lie, Arg or Asn 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Xaa Xaa Xaa Xaa Xaa Pro Xaa Xaa Xaa 
1 5 

25 (2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE : amino acid 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



35 



40 



45 



50 



55 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /note= "Xaa is Arg or His" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note- "Xaa is Arg, Pro, Met, Val or lie" 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "Xaa is Pro or Met" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 4 

(C) OTHER INFORMATION: /note= "Xaa is Met, Gin, Ala, Asn, Arg or 

Lys" 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 
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(D) OTHER INFORMATION: /note= "Xaa is Ser or Thr" 

(ix) FEATURE : 

(A) NAME/KEY: Modif ied-site 
5 (B) LOCATION: 7 

(D) OTHER INFORMATION : /note= "Xaa is Pro, Lys, Met, Ala or Val 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
10 (B) LOCATION: 8 

(D) OTHER INFORMATION: /note= "Xaa is Lys, Arg or Pro" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
15 (B) LOCATION: 9 

(D) OTHER INFORMATION: /note= "Xaa is Lys, lie or Arg" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

20 Xaa Xaa Xaa Xaa Xaa Pro Xaa Xaa Xaa 

1 5 

(2) INFORMATION FOR SEQ ID NO: 9: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(ix) FEATURE: 
35 (A) NAME/KEY: Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /note= "Xaa is Arg, His, Asn or Gin" 

(ix) FEATURE: 
40 (A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note- "Xaa is Arg, His, Val, Pro or Met' 

(ix) FEATURE: 
45 (A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note- "Xaa is Pro, Met or Leu" 

(ix) FEATURE: 
50 (A) NAME/KEY: Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /note= "Xaa is Met, Gin, Arg, Asn or Ala' 

(ix) FEATURE: 
55 (A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 

<D) OTHER INFORMATION: /note= "Xaa is Ser or Thr" 



(ix) FEATURE: 
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(A) NAME/KEY: Modified- site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= "Xaa is Pro, Met, Arg, Val or Lys" 

5 (ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION : 8 

(D) OTHER INFORMATION: /note* M Xaa is Lys or Arg" 

10 (ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /note* "Xaa is Lys or Asn" 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Xaa Xaa Xaa Xaa Xaa Pro Xaa Xaa Xaa 
1 5 

20 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



30 



35 



40 



45 



50 



55 



(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /note* "Xaa is Glu or Asp" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "Xaa is Glu or Asp" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "Xaa is Glu, Asp, Gly or Asn" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /note* "Xaa is lie, Val, Leu or Pro" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /note= "Xaa is Gly, Glu or Asp" 

(ix) FEATURE: 

<A) NAME/KEY: Modif ied-site 
(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= "Xaa is Glu or Asp" 
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(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 8 

5 (D) OTHER INFORMATION: /note* "Xaa is Phe, He, Leu, Val or Met" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 9 

10 (D) OTHER INFORMATION: /note= "Xaa is Glu, Phe or Asp" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Xaa Xaa Xaa Xaa Tyr Xaa Xaa Xaa Xaa 
15 1 5 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



25 



(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
30 (B) LOCATION: 1 

(D) OTHER INFORMATION: /note= "Xaa is Glu or Asp" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
35 (B) LOCATION: 2 

(C) OTHER INFORMATION: /note= "Xaa is Glu or Asp" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
40 (B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "Xaa is Glu, Asp or Gly" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
45 (B) LOCATION: 4 

(D) OTHER INFORMATION: /note= "Xaa is He, Val, Leu or Pro" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
50 (B) LOCATION: 6 

(D) OTHER INFORMATION: /note= "Xaa is Gly or Glu" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
55 (B) LOCATION: 8 

<D) OTHER INFORMATION : /note= "Xaa is Phe, He, Leu or Met" 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 
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(B) LOCATION: 9 

(D) OTHER INFORMATION : /note= "Xaa is Phe or Glu" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

5 

Xaa Xaa Xaa Xaa Tyr Xaa Glu Xaa Xaa 
1 5 

(2) INFORMATION FOR SEQ ID NO: 12: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 

20 (ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 1 

(C) OTHER INFORMATION: /note= "Xaa is Glu or Asp" 

25 (ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note- "Xaa is Glu or Asp" 

30 (ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "Xaa is Glu, Asp, Gly or Asn H 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /notes "Xaa is lie, Val, Leu or Pro" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /note* "Xaa is Gly, Glu or Asp" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note- "Xaa is Glu or Asp" 

(ix) FEATURE : 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /note* "Xaa is Phe, He, Leu, Val or Met" 

(ix) FEATURE: 

<A) NAME/ KEY: Modif ied-site 
(B) LOCATION: 9 

(D) OTHER INFORMATION: /note- "Xaa is Glu or Asp" 



WO 95/18823 



-59- 



PCT/US95/00147 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Xaa Xaa Xaa Xaa Tyr Xaa Xaa Xaa Xaa 
1 5 

5 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

15 (v) FRAGMENT TYPE: internal 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 1 

20 <D) OTHER INFORMATION: /note= "Xaa is Glu, Asp, Arg or Ala" 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 2 

25 (D) OTHER INFORMATION: /note* "Xaa is Glu or Asp" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

30 (D) OTHER INFORMATION: /note= "Xaa is Glu, Asp, Ala, Gin or Pro" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 4 

35 (D) OTHER INFORMATION: /note= "Xaa is Glu, Asp, lie, Asn or Val" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 6 

40 (D) OTHER INFORMATION: /note* "Xaa is Phe, Val, He, <31u, Met or 

Leu" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 
45 (B) LOCATION: 7 

(D) OTHER INFORMATION: /note= "Xaa is Glu, Phe, Asp, Val or He" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
50 (B) LOCATION: 8 

(C) OTHER INFORMATION: /note= "Xaa is Leu, He, Phe, Val or Met" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
55 * * (B) LOCATION: 9 

(D) OTHER INFORMATION: /note= "Xaa is Val, Phe, Pro, Glu, Leu or 

lie" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
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Xaa Xaa Xaa Xaa Tyr Xaa Xaa Xaa Xaa 
1 5 

(2) INFORMATION FOR SEQ ID NO: 14: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /note= "Xaa is Glu, Ala or Asp" 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note* "Xaa is Glu or Ala" 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "Xaa is Glu, Ala or Gly" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /note= "Xaa is lie, Glu, Val, Asp or Leu" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /note= "Xaa is Glu, Asp or Ala" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note« "Xaa is Glu, Asp, He or Val" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
(Bl LOCATION: 8 

(D) OTHER INFORMATION: /note* "Xaa is He, Val, Glu, Met or Leu" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /note= "Xaa is Glu, He, Val, Asp or Gly" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



Xaa Xaa Xaa Xaa Tyr Xaa Xaa Xaa Xaa 
1 5 
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10 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

15 Gly Arg Arg Asn Ala lie His Asp lie 

1 5 

(2) INFORMATION FOR SEQ ID NO: 16: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: peptide 

<v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 



30 



35 



40 



Leu Arg Arg Ala Ser Leu Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17* 

Gly Arg Thr Gly Arg Arg Asn Ser He 
1 5 

50 (2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: peptide 

iv) FRAGMENT TYPE: internal 
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(xi) SEQUENCE DESCRIPTION: SBQ ID NO:18 

Phe Arg Arg Leu Ser lie Ser Thr Glu 
5 1 5 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 

20 Lys Arg Ser Gly Ser Val Tyr Glu Pro 

1 5 

(2) INFORMATION FOR SEQ ID NO: 20: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 



35 



40 



45 



Pro Arg Arg Ala Ser Cys Thr Ser Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

Lys Arg Ser Asn Ser Val Asp Thr Ser 
1 5 

55 (2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

5 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Ser Arg Thr Leu Ser Val Ser Ser Leu 
10 1 5 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 
IS (A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

25 Leu Arg Arg Ala Ser Leu Ala Gin Leu 

1 5 

(2) INFORMATION FOR SEQ ID NO: 24: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



Arg Arg Arg Gly Ser Ser lie Pro Gin 
1 5 

(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Arg Arg Arg Pro Thr Pro Ala Thr Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

10 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 

Gin Arg Arg Arg Ser Leu Glu Pro Ala 
15 1 5 

(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



25 



(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 

30 Val Ala Ala Lys Ser Pro Ala Lys Ala 

1 5 

(2) INFORMATION FOR SEQ ID NO: 28: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 



45 



50 



55 



Lys Ala Ala Lys Ser Pro Lys Lys Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 

Gly Thr Pro Leu Ser Pro Thr Arg lie 
1 5 

5 

(2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

15 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 

Asp Thr His Arg Thr Pro Ser Arg Ser 
20 i 5 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



30 



(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 

35 ser Gin His Ser Thr Pro Pro Lys Lys 

1 5 

(2) INFORMATION FOR SEQ ID NO: 32: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 



50 



55 



Tyr Thr Thr Asn Ser Pro Ser Lys lie 
1 5 

(2) INFORMATION FOR SEQ ID NO:33: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 



10 



35 



55 



Val lie Lys Arg Ser Pro Arg Lys Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 34: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

20 (v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Gly Arg Pro lie Thr Pro Pro Arg Lys 
25 l 5 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS : 
30 (A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 

40 Glu Phe Pro Leu Ser Pro Pro Lys Lys 

1 5 

(2) INFORMATION FOR SEQ ID NO: 36: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 



Arg Arg Pro Val Ser Pro Gly Lys Asp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 37: 
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(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



5 



(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Arg Arg Glu Met Ser Pro Pro Phe Lys 



(2) INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide . 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

Asp Lys Pro Leu Ser Pro Pro Gin Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

Arg Lys Ala Arg Ser Pro Ala Lys Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



1 



5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
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Arg Arg Pro Leu Ser Pro Glu Ala Ser 
1 5 

5 (2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 



15 



20 



Asp Tyr Pro Leu Ser Pro Pro Lys Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 42: 



(i) SEQUENCE CHARACTERISTICS : . 
(A) LENGTH: 4 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

30 (v) FRAGMENT TYPE: internal 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 1 

35 (D) OTHER INFORMATION: /note- "Xaa is phospho-Tyr" 

(ix) FEATURE: 

(A) NAME/ KEY: Modif ied- site 

(B) LOCATION: 2 

40 (D) OTHER INFORMATION: /note- "Xaa is Gly or Glu" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 3 

45 (D) OTHER INFORMATION: /note= w Xaa is Phe or lie" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

Xaa Xaa Glu Xaa 
50 i 

(2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(v) FRAGMENT TYPE: internal 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /note- "Xaa is phospho-Tyr" 



10 



15 



20 



40 



45 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 43: 

Xaa Glu Glu He 
1 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Glu Glu Ala Tyr Gly Trp Met Asp 
1 5 

30 (2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



Thr Gly He Tyr Glu Ala Leu Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 46: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 amino acids 
5Q (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

55 * *(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

^51u Gly Thr Tyr <31y Val Val tyr 
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1 5 
(2) INFORMATION FOR SEQ ID NO: 47: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 



15 



20 



25 



45 



50 



. Glu Glu Glu Tyr Met Pro Met Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:48 

Glu Asn Glu Tyr Met Pro Met Ala 
1 5 

35 (2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 



Glu Pro Gin Tyr Glu Glu lie Pro 
1 5 

(2) INFORMATION FOR SEQ ID NO: 50: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 amino acids 
55 (b) TYPE: amino acid 

(D) TOPOLOGY: linear 



Ui) MOLECULE TYPE: peptide 
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(v) FRAGMENT TYPE: internal 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50 

5 Glu Asp Gin Tyr Ser Leu Val Glu 

1 5 

(2) INFORMATION FOR SEQ ID NO: 51: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 



20 



25 



30 



50 



55 



Asp Asn <31u Tyr Thr Ala Arg Gin 
1 5 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Leu Tyr Asp Tyr Glu Ser Trp lie 
1 5 

40 (2) INFORMATION FOR SEQ ID NO: 53: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 



Thr Asp Asp Tyr Ala Glu lie Asp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH : 8 amino acids 
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(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

5 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

10 Glu Gly Leu Tyr Asn Glu Leu Gin 

1 5 

(2) INFORMATION FOR SEQ ID NO: 55: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

25 

Ala Glu Ala Tyr Ser Glu lie Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

40 (ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /note= "Xaa is any hydrophobic amino acid" 

45 (ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note* "Xaa is any amino acid" 

50 (ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note* "Xaa is any hydrophobic amino acid" 

55 (ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /note= "Xaa is any hydrophobic amino acid" 



30 



35 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Xaa Xaa Xaa Xaa 
1 

5 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

15 (v) FRAGMENT TYPE: internal 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 1 

20 (D) OTHER INFORMATION: /note= "Xaa is phospho-Tyr" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

25 (D) OTHER INFORMATION: /note= "Xaa is any hydrophobic amino acid 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

30 (D) OTHER INFORMATION: /note= "Xaa is any amino acid" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /note= "Xaa is any hydrophobic amino acid 



35 



40 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Xaa Xaa Xaa Xaa 
1 



(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Pro Val Pro Glu Tyr lie Asn Gin Ser 
1 5 



(2) INFORMATION POR SEQ ID NO:59: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 ■ 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

10 (xi)' SEQUENCE DESCRIPTION: SEQ ID NO: 59 

Gin Asn Pro Val Tyr His Asn Gin Pro 
1 5 

15 (2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 



25 



30 



55 



Gly Asn Pro Glu Tyr Leu Asn Thr Val 
1 5 

(2) INFORMATION FOR SEQ ID NO: 61: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

40 (v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Asp Asn Pro Asp Tyr Gin Gin Asp Phe 
45 i 5 

(2) INFORMATION FOR SEQ ID NO: 62: 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
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Glu Asn Ala Glu Tyr Leu Arg Val Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 63: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 

Asp Ala Asp Glu Tyr Leu lie Pro Gin 
1 5 

20 (2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
25 <D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 



30 



35 



Glu Glu Gin <31u Tyr Val Gin Thr Val 
1 5 

(2) INFORMATION FOR SEQ ID NO: 65: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 
40 (b) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

45 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 

Val Asp Glu Met Tyr Arg Glu Ala Pro 
50 i s 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS : 
55 (a) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



10 



15 
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. (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Gly Asn Phe Asn Tyr Val Glu Phe Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Ala Glu Pro Asp Tyr Gly Ala Leu Tyr 
1 5 

25 (2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 



35 



40 



Phe Glu Ala Arg Tyr Gin Gin Pro Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO:69: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

50 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Glu Glu <31u Ala Tyr Gly Trp Met Asp 
5.5 1 5 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 



10 



15 



20 
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(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 



40 



45 



Gly Gin Glu Glu Tyr Ser Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Val Glu Ser Ala Tyr Glu Val He Arg 
1 5 

30 (2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 



Ser Thr Gly He Tyr Glu Ala Leu Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 73: 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 9 amino acids 
50 (b) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

55 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
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10 



35 



Glu Asp Gly Val Tyr Ala Ser Thr Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Ala Asp Gly lie Tyr Ala Ala Ser Gly 
1 5 

20 (2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 



30 



(ii) 


MOLECULE TYPE: peptide 




(v) 


FRAGMENT TYPE: internal 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO; 


:75: 


Ala 


Arg Arg Pro Met Ser Pro Lys Lys 


Lys Ala 


1 


5 


10 



(2) INFORMATION FOR SEQ ID NO: 76: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
45 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Ala Glu Glu Glu lie Tyr Gly Glu Phe Glu Ala Lys Lys Lys Lys 
50 1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 
55 * ' (A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

He Glu Lys He Gly Glu Gly Thr Tyr Gly Val Val Tyr Lys Gly Arg 
15 10 15 

His Lys Thr Thr 
10 20 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 9 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
25 (B) LOCATION: 1 

(D) OTHER INFORMATION: /note* "Xaa is any amino acid" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
30 (B) LOCATION: 2 

(D) OTHER INFORMATION: /note* "Xaa is Glu, Gly, Pro or Ala" 

(ix) FEATURE: 

(A) NAME/ KEY: Modif ied-site 
35 (B) LOCATION: 3 

(D) OTHER INFORMATION: /note* "Xaa is is any amino acid" 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 
40 (B) LOCATION: 4 

(D) OTHER INFORMATION: /note* "Xaa He, Val, Leu, Phe or Met" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
45 (B) LOCATION: 6 

(D) OTHER INFORMATION: /note- "Xaa is Gly, Ala, Met, Glu or Val 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 
50 (B) LOCATION: 7 

(D) OTHER INFORMATION : /note* "Xaa is Val, Ala, Pro, Met, He, 

Phe, Lys or Leu" 

(ix) FEATURE: 
55 (A) NAME/KEY: Modif ied-site 

(B) LOCATION : 8 

(D) OTHER INFORMATION: /note* "Xaa is Leu, Val, Phe, He or Met 
(ix) FEATURE: 
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10 



25 



30 



35 



(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 9 

. (D) OTHER INFORMATION: /note= "Xaa is Phe, Leu, Val, lie, Pro, 

Met, Lys or Gly M 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:78: 

Xaa Xaa Xaa Xaa Tyr Xaa Xaa Xaa Xaa 
1 5 

(2) INFORMATION FOR SEQ ID NO: 79: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

20 (v) FRAGMENT TYPE: internal 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /note= "Xaa is Ala, Arg, Gly or Pro" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note* "Xaa is any amino acid" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "Xaa is Val, Ala, Pro, He, Gly or 

Glu" 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
40 (B) LOCATION: 4 

(D) OTHER INFORMATION: /note* "Xaa is He, Val, Leu or Met" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
45 (B) LOCATION: 6 

(D) OTHER INFORMATION: /note* "Xaa is Ala, Met, Gin, Gly or Phe" 

(ix) -FEATURE: _ - - - .- - - - - 

(A) NAME /KEY: Modif ied-site 
50 (B) LOCATION: 7 

(D) OTHER INFORMATION: /note* "Xaa is Ala, Val, Phe, Met, Gin, 

Leu or He" 

(ix) FEATURE: 
55 (A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /note* "Xaa is Pro, Phe, Leu, He, Val or 

Ala" 
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(ix) FEATURE: 

(A) NAME /KEY: Modified- site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /note= n Xaa is Phe, Leu, He, Met, Gly or 
5 Pro" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Xaa Xaa Xaa Xaa Tyr Xaa Xaa Xaa Xaa 
10 1 5 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE : peptide 
(v) FRAGMENT TYPE: internal 



(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 
25 (B) LOCATION: 1 

(D) OTHER INFORMATION: /note= "Xaa is Glu or Asp" 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 
30 (B) LOCATION: 2 

(D) OTHER INFORMATION: /note- "Xaa is Glu or Asp" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
35 (B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "Xaa is Glu or Asp" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
40 (B) LOCATION: 4 

(D) OTHER INFORMATION: /note= "Xaa is Glu or Asp" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
45 (B) LOCATION: 6 

(D) OTHER INFORMATION: /note= "Xaa is Val, Glu or He" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
50 (B) LOCATION: 8 

(D) OTHER INFORMATION: /note= "Xaa is He, Val or Phe" 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 
55 (B) LOCATION: 9 

(D) OTHER INFORMATION: /note= "Xaa is any amino acid" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
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Xaa Xaa Xaa Xaa Tyr Xaa Phe Xaa Xaa 
1 5 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

15 (ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /note* "Xaa is Ala, Lys, Glu, Phe, Pro or 

Gly" 



10 



20 



25 



30 



50 



55 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note* "Xaa is Glu, Ala, Phe, Lys, Met, 

Gly, He or Val n 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /notes "Xaa is Glu, Lys, Ala, Phe, Pro, 

Gly or Val" 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
35 (B) LOCATION: 4 

(D) OTHER INFORMATION: /note- "Xaa is Glu, He, Val, Asp, Phe, 

Ala or Leu" 

(ix) FEATURE: 
40 (A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /note- "Xaa is Phe, Val, Glu, He, Met, 

Ala or Lys" 

45 (ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= "Xaa is Phe, Val, Met, lie, Leu, 

Glu, Pro or Lys" 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /note- "Xaa is Leu, Phe, Met, He, Val or 

Glu" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 9 
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(D) OTHER INFORMATION: /noteo "Xaa is Phe, Leu, Met, Val, Glu, 

lie or Pro 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

5 

Xaa Xaa Xaa Xaa Tyr Xaa Xaa Xaa Xaa 
1 5 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /note- "Xaa is any amino acid" 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note- "Xaa is Glu, Gly, Ala, Phe, Asn or 

Met" 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note* "Xaa is Glu, Ala, Gly, Pro or Asn" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /note- "Xaa is Glu, Asn, Asp, Ala, He, 

Pro or Gly" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /note* "Xaa is Met, Phe, Ala, Glu or Val" 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= "Xaa is Met, Phe, Val, Glu, Ala, 

Asp, Leu or He" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /note= "Xaa is Met ,Phe, He, Glu or Val" 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 
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(B) LOCATION: 9 

(D) OTHER INFORMATION: /note= "Xaa is Met, Phe, Glu, Val, Leu, 

lie or Gly" 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Xaa Xaa Xaa Xaa Tyr Xaa Xaa Xaa Xaa 
1 5 

10 (2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



20 



25 



30 



35 



40 



45 



50 



55 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-eite 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /note= Xaa is Glu, Asp, Ala or Gly 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 

(B) LOACTION: 2 

(D) OTHER INFORMATION: /note* Xaa is Asp, Glu, Gly or Ala 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= Xaa is Glu, Asp, Ala or Gly 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /note= Xaa is Glu, Asp, Gly, Ala or Met 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note* Xaa is Ser or Thr 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /note= Xaa is Glu, Asp or Ala 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note- Xaa is Asp, Glu or Ala 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 8 
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(D) OTHER INFORMATION: /note= Xaa is Glu, Asp or Ala 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
5 (B) LOCATION: 9 

(D) OTHER INFORMATION: /note= Xaa is Glu, Asp or lie 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:83: 

10 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

1 5 

(2) INFORMATION FOR SEQ ID NO: 84: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: peptide 



(v) FRAGMENT TYPE: internal 

(ix) FEATURE: 
25 (A) NAME/KEY: Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /note= Xaa is Arg, Asn, Phe, Met, Gin or 

Gly 

30 (ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 
<B) LOACTION: 2 

(D) OTHER INFORMATION: /note= Xaa is Phe, Leu or Met 

35 (ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note* Xaa is Arg, Lys or Asn 



FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /notes Xaa is Arg, Lys, Met or <31n 

FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /notes Xaa is Ser or Thr 

FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /note= Xaa is Arg, lie, Val, Met, Leu or 

Gin 

(ix) FEATURE: 

<A) NAME /KEY : Modif ied-site 
<B) LOCATION: 7 



40 

(ix) 



45 

(ix) 



50 

(ix) 
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(D) OTHER INFORMATION: /note= Xaa is Arg, He, Met, Val, Phe or 

Leu 

(ix) FEATURE: 
5 (A) NAME/KEY: Modif ied-site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /note*= Xaa is Met, He, Phe, Val, Leu or 

Gly 

10 (ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /note= Xaa is He, Phe, Met, Gly, Leu or 



15 



20 



55 



Val 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 

(2) INFORMATION FOR SEQ ID NO: 85: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

30 (v) FRAGMENT TYPE: internal 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 1 

35 <D) OTHER INFORMATION: /note= Xaa is Phe, Met, Lys, He, His or 

Val 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
40 (B) LOACTION: 2 

(D) OTHER INFORMATION: /notes* Xaa is Arg or Lys 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
45 (B) LOCATION: 3 

(D) OTHER INFORMATION: /note= Xaa is Met, Arg, Gin, Phe, Lys or 

Leu 



50 (ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 
<B) LOCATION: 4 

(D) OTHER INFORMATION: /note= Xaa is Met, Phe, Leu, He, Lys or 

Val 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
<B) LOCATION: 5 

(D) OTHER INFORMATION: /note= Xaa is Ser or Thr 
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(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

5 (D) OTHER INFORMATION: /note= Xaa is Phe, lie, Met, Leu or Val 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 7 

10 (D) OTHER INFORMATION: /note» Xaa is Phe, Arg, Lys, Met or lie 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 8 

15 (D) OTHER INFORMATION: /note* Xaa is Leu, He, Met, Pro or Val 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 9 

20 (D) OTHER INFORMATION: /note- Xaa is Phe, Leu, He, Val, Pro or 

Met 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

25 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

1 5 

(2) INFORMATION FOR SEQ ID NO: 86: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(ix) FEATURE: 
40 (A) NAME/KEY: Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /note= Xaa is Lys, Phe, Asp; His or Met 

(ix) FEATURE: 

(A) NAME/KEY: Modif led- site 

(B) LOACTION: 2 

(D) OTHER INFORMATION: /note= Xaa is Arg, Lys, Phe or Met 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= Xaa is Gin, Met, Lys, Ala or Phe 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 4 

(O) OTHER INFORMATION: /note= Xaa is Gin, Met, Lys, Leu, Phe or 

Ala 



WO 95/18823 



-88- 



PCTADS95/00147 



(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 5 

5 (D) OTHER INFORMATION: /note= Xaa is Ser or Thr 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

10 (D) OTHER INFORMATION: /note= Xaa is Phe, lie, Met, Leu or Val 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 

15 (D) OTHER INFORMATION: /note= Xaa is Asp, Glu, He, Met, Lys, Val 

or Leu 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
20 (B) LOCATION: 8 

(D) OTHER INFORMATION: /note= Xaa is Leu, Met, Lys, He, Val, Gin 

or Gly 

(ix) FEATURE: 
25 (A) NAME/KEY: Modif ied-site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /notes Xaa is Phe, Lys, Met, Val, Leu, He 

or Pro 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 

35 (2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



45 



50 



55 



(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /notea Xaa is Thr, Pro, Ser, Gin, He, 

Val, Asn or Glu 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOACTION: 2 

(D) OTHER INFORMATION: /note= Xaa is Gly, Pro, Glu, Tyr or Gin 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 
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(D) OTHER INFORMATION: /note* Xaa is Pro, Leu, He or Val 

(ix) FEATURE: 
5 (A) NAME/KEY: Modif ied-site 

(B) LOCATION: 4 

<D) OTHER INFORMATION: /note= Xaa is Leu, Met, Pro, Lys, Gin or 

Thr 

10 (ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note= Xaa is Ser or Thr 

15 (ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= Xaa is Gly, Pro, Phe, Thr or Leu 

20 (ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /note= Xaa is Pro, Phe, Gly, Tyr, Leu or 

lie 



25 



30 



35 



45 



50 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /note= Xaa is Phe, Tyr, He or Leu 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

Xaa Xaa Xaa Xaa Xaa Pro Xaa Xaa Xaa 
1 5 



(2) INFORMATION FOR SEQ ID NO:88: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88 : 

Glu Ala He Tyr Ala Ala Pro Phe Ala Lys Lys Lys 
15 io 
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CLAIMS 

1 . A method for determining an amino acid sequence motif for a phosphorylation 
site of a protein kinase, comprising: 

5 

a) contacting the protein kinase with an oriented degenerate peptide library 
comprising non-phosphorylated peptides with a phosphorylatable amino acid 
residue at a fixed non-degenerate position under conditions which allow for 
phosphorylation of a substrate by the protein kinase; 

10 

b) allowing the protein kinase to phosphorylate peptides within the oriented 
degenerate peptide library having a phosphorylation site for the protein kinase 
to form a population of phosphorylated peptides; 

15 c) separating the population of phosphorylated peptides from non- 

phosphorylated peptides; 

d) determining the amino acid sequences of the population of phosphorylated 
peptides; and 

20 

e) determining an amino acid sequence motif for a phosphorylation site of the 
protein kinase based upon the relative abundance of different amino acid 
residues at each degenerate position within the population of phosphorylated 
peptides. 

25 

2. The method of claim 1 , wherein the oriented degenerate peptide library is a 
soluble synthetic peptide library, 

3. The method of claim 1, wherein the oriented degenerate peptide library is a 
30 solid support-bound library or a phage expression library. 

4. The method of claim 1 , wherein the oriented degenerate peptide library 
comprises peptides comprising a formula: 

35 (Xaafo-Zaa-pCaaJm 

wherein Zaa is a non-degenerate phosphorylatable amino acid selected from the group 
consisting of Ser, Thr and Tyr, Xaa is any amino acid and n and m are integers from 1-10 
inclusive. 
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5. The method of claim 1, wherein the phosphorylatable amino acid residue at a 
fixed non-degenerate position of the non-phosphorylated peptides of the oriented degenerate 
peptide library is the only phosphorylatable amino acid residue in the non-phosphorylated 

5 peptides. 

6. The method of claim 1 , wherein the protein kinase is a protein- 
serine/threonine specific kinase and the oriented degenerate peptide library comprises non- 
phosphorylated peptides comprising a formula: 

10 

(XaaVZaa-CXaaJjn 

wherein Zaa is a non-degenerate amino acid selected from Ser or Thr, Xaa is any amino acid 
except Ser or Thr and n and m are integers from 1-10 inclusive. 

15 

7. The method of claim 1 , wherein the protein kinase is a protein-tyrosine 
specific kinase and the oriented degenerate peptide library comprises non-phosphorylated 
peptides comprising a formula: 

20 (Xaa^Zaa-pCaa^ 

wherein Zaa is Tyr, Xaa is any amino acid except Tyr and n and m are integers from 1-10 
inclusive. 

25 8. The method of claim 1 , wherein the protein kinase is a protein kinase selected 

from a group consisting of protein-serine/threonine specific kinases, protein-tyrosine specific 
kinases and dual-specificity kinases and the oriented degenerate peptide library comprises 
non-phosphorylated peptides comprising a formula: 

30 (XaaVZaa-CXaa^ 

wherein Zaa is a non-degenerate amino acid selected from the group consisting of Ser, Thr 
and Tyr, Xaa is any amino acid except an amino acid selected from the group consisting of 
Ser, Thr and Tyr and n and m are integers from 1-10 inclusive. 

35 

9. The method of claim 1 , wherein the oriented degenerate peptide library 

comprises non-phosphorylated peptides comprising a formula: 



Y x -{XaaVZaa-CXaaWYj 
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wherein 

Zaa is a non-degenerate phosphorylatable amino acid selected from the 
group consisting of Ser, Thr and Tyr, 
5 Xaa is any amino acid, 

n and m are integers from 1-10 inclusive, 

Yj is hydrogen or a peptide having a formula (Xaa) a wherein Xaa is any 

non-degenerate amino acid and a is an integer from 1-15 inclusive, and 
Y 2 is hydrogen or a peptide having a formula (Xaa)b wherein Xaa is any 
10 non-degenerate amino acid and b is an integer from 1-15 inclusive. 

10. The method of claim 9, wherein a and b are integers from 1-10 inclusive. 

1 1 . The method of claim 9, wherein a and b are integers from 1-5 inclusive. 

15 

12. The method of claim 1, wherein the oriented degenerate peptide library 
comprises non-phosphorylated peptides comprising a formula: 

Met-Ala-Xaal-Xaa2-Xaa3-Xaa4-Xaa5-Xaa6-Xaa7-Xaa8-Xaa9-Ala-Lys-Lys-Ly^ 

20 

wherein Xaal , Xaa2, Xaa3, Xaa4, Xaa6, Xaa7, Xaa8, and Xaa9 are any amino acid and Xaa5 
is a non-degenerate phosphorylatable amino acid selected from the group consisting of Ser, 
Thr and Tyr. 

25 13. The method of claim 1 , wherein the population of phosphory lated peptides is 

separated from non-phosphorylated peptides by binding the population of phosphorylated 
peptides to a ferric column. 

1 4. The method of claim 1 , wherein the protein kinase thio-phosphorylates 

30 peptides within the degenerate peptide library having a phosphorylation site for the protein 
kinase to form a population of thio-phosphory lated peptides and the population of thio- 
phosphorylated peptides are separated from non-phosphorylated peptides. 

1 5. The method of claim 1 4, wherein the population of thio-phosphorylated 
35 peptide is separated from non-phosphorylated peptides by binding the population of thio- 
phosphorylated peptides to a mercury column, wherein peptides within the library do not 
contain Cys. 
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1 6. The method of claim 1 , wherein the protein kinase is cyclic nucleotide- 
dependent protein kinase. 

17. The method of claim 16, wherein the cyclic nucleotide-dependent protein 
5 kinase is protein kinase A 

1 8. The method of claim 1 , wherein the protein kinase is a kinase involved in cell 
cycle control 

10 19. The method of claim 1 8, wherein the protein kinase involved in cell cycle 

control is selected from the group consisting of cyclin B/p33 c< * c 2, cyclin A/ptt™^, cyclin 
E/p33 cdk2 , cyclin Dl/p33 cdk4 , Weel kinase, Niml/Cdrl kinase, Wis 1 kinase and NIMA. 

20. The method of claim 1 , wherein the protein kinase is a member of the src 
1 5 family of protein kinases. 

21 . The method of claim 20, wherein member of the src family of protein kinases 
is selected from the group consisting of ppeO 0 - 510 , pp^** 1 " 0 , Yes, Fgr, FYN, LYN, LCK, 
HOC, Dsrc64 and Dsrc28. 

20 

22. The method of claim 1, wherein the protein kinase is a member of the 
epidermal growth factor receptor family of protein kinases. 

23. The method of claim 22, wherein the member of the epidermal growth factor 
25 receptor family of protein kinases is the epidermal growth factor receptor. 

24. The method of claim 1, wherein the protein kinase is a member of the Abl 
family of protein kinases. 

30 25. The method of claim 24, wherein the member of the Abl family of protein 

kinases is c- abl or pp92 c -*P s/fes . 

26. The method of claim 1, wherein the protein kinase is a member of a family of 

protein kinases selected from the group consisting of calcium phospholipid-dependent 
35 kinases, calcium-calmodulin-dependent kinases, SNF1 kinases, casein kinases, Raf-Mos 
proto-oncogene kinases, STE7 kinases, insulin receptor-like kinases and platelet-derived 
growth factor receptor-like kinases. 
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27. The method of claim 1 , wherein the protein kinase is a protein kinase selected 
from the group consisting of syk, ZAP70, Focal Adhesion Kinase, erkl, erk2, erk3, MEK, 
CSK, BTK, ITK, TEC, TEC-2, JAK-1, JAK-2, LET23, c-fins, S6 kinases, TGF-p/activin 
receptor family kinases and Clk 

5 

28. The method of claim 1 , wherein the amino acid sequence motif for a 
phosphorylation site of the protein kinase is determined by calculating a relative abundance 
(RA) value for each amino acid residue (Xaa) at each degenerate position, wherein RA is 
determined by a formula: 

10 

T? A = amount of Xaa in the population of pho sphorvlated pep tides 
amount of Xaa in the oriented degenerate peptide library 

and selecting amino acid residues that have a relative abundance value of greater than 1.0 at a 
15 degenerate position for inclusion at a position corresponding to the degenerate position in the 
amino acid sequence motif. 

29. A substrate for a protein kinase, comprising a peptide comprising an amino 
acid sequence of the formula: 

20 

Xaal.Xaa2-Xaa3-Xaa4-Xaa5-Xaa6-Xaa7-Xaa8-Xaa9 

wherein Xaa5 is selected from a group consisting of Ser, Thr and Tyr, and Xaal, Xaa2, Xaa3, 
Xaa4, Xaa6, Xaa7, Xaa8 and Xaa9 are determined by: 

25 

a) contacting the protein kinase with an oriented degenerate peptide library 
comprising non-phosphorylated peptides comprising a formula: 

Met-Ala-Xaal-Xaa2-Xaa3-Xaa4^ 

30 

wherein Xaal, Xaa2, Xaa3, Xaa4, Xaa6, Xaa7, Xaa8, and Xaa9 are any amino 
acid except an amino acid selected from the group consisting of Ser, Thr and 
Tyr and Xaa5 is selected from a group consisting of Ser, Thr and Tyr, under 
conditions which allow for phosphorylation of a substrate by the protein 
35 kinase; 

b) allowing the protein kinase to phosphorylate peptides within the oriented 
degenerate peptide library having a phosphorylation site for the protein kinase 
to form a population of phosphorylated peptides; 
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c) separating the population of phosphorylated peptides from non- 
phosphorylated peptides by binding the population of phosphorylated peptides 
to a ferric column; 

5 

d) determining the amino acid sequences of the population of phosphorylated 
peptides; and 

e) determining a relative abundance (RA) for each amino acid residue at 

1 0 positions in the phosphorylated peptides corresponding to Xaal , Xaa2, Xaa3, 

Xaa4, Xaa6, Xaa7, Xaa8, and Xaa9, wherein the relative abundance is 
determined by a formula: 

R A = amount of Xaa in the population o f phosphoTylated peptides 
15 amount of Xaa in the oriented degenerate peptide library 

and 



f) determining the amino acid sequences of Xaal, Xaa2, Xaa3> Xaa4, Xaa6, 
20 Xaa7, Xaa8, and Xaa9 by selecting amino acid residues that have a relative 

abundance of greater than 1 .0 at positions in the phosphorylated peptides 
corresponding to Xaal, Xaa2, Xaa3, Xaa4, Xaa6, Xaa7, Xaa8, and Xaa9. 

30. The substrate of claim 29, wherein the protein kinase is a protein- 
25 serine/threonine specific kinase or a dual-specificity kinase and Xaa5 is Ser or Thr. 

3 1 . The substrate of claim 30, wherein the protein kinase is a protein-tyrosine 
specific kinase or a dual-specificity kinase and Xaa5 is Tyr. 

30 32. A substrate for protein kinase A, comprising a peptide comprising an amino 

acid sequence: 

Y r Xaal -Xaa2-Xaa3-Xaa4-Xaa5-Xaa6-Xaa7-Xaa8-Xaa9-Y 2 

35 wherein one or more amino acids, selected from the group consisting of Xaal, Xaal-Xaa2, 
Xaal-Xaa2-Xaa3, Xaa9 and Xaa8-Xaa9, can be present or absent, 

wherein if Xaal, Xaal-Xaa2 or Xaal-Xaa2-Xaa3 are absent, Yj is hydrogen or an 
amino-deri vative group, 
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or if Xaal-Xaa2-Xaa3 is present, Y\ is hydrogen, an amino-derivative group or a 
peptide having a formula (Xaa) a wherein Xaa is any amino acid and a 
is an integer from 1-15 inclusive, 
if Xaa9 or Xaa8-Xaa9 are absent, Y 2 is hydrogen or a carboxy-derivative 
5 group, 

or if Xaa8-Xaa9 is present Y 2 is hydrogen, a carboxy-derivative group or a 

peptide having a formula (Xaa)b wherein Xaa is any amino acid and b 
is an integer from 1-15 inclusive, 
Xaal , if present, is Arg or His, 
10 Xaa2, if present, is Arg or His, 

Xaa3, if present, is Arg, Lys or His, 
Xaa4 is Arg or His, 
Xaa5 is Ser or Thr, 
Xaa6 is He, Met, Phe, Leu or Val, 
15 Xaa7 is lie, Val or Phe, 

Xaa8, if present, is Phe, He, Met, Val or Leu, and 
Xaa9, if present, is He, Leu, Phe, Met or Val. 

33. The peptide of claim 32, wherein Xaa3 is Arg or Lys, Xaa8 ia Phe, He or Met 
20 and Xaa9 is lie. 

34. A pseudosubstrate for protein kinase A, comprising the peptide of claim 32, 
wherein Xaa5 is any amino acid except Ser or Thr and the pseudosubstrate can bind to 
protein kinase A but is not phosphorylated by protein kinase A. 

25 

35. The pseudosubstrate of claim 34, wherein Xaa5 is selected from a group 
consisting of Ala, Asn and Glu. 

36. A protein modified to comprise the substrate for protein kinase A as claimed 
30 in claim 32. 

37. A peptide analog of the peptide of claim 32. 

38. A method for inhibiting phosphorylation of a native substrate by protein 

35 kinase A comprising contacting protein kinase A with an inhibitory amount of the substrate 
for protein kinase A as claimed in claim 32 to competitively inhibit phosphorylation of a 
native substrate by protein kinase A. 
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5 



10 



39 A substrate for a cell cycle control kinase, comprising a peptide comprising ai\ 

amino acid sequence: 

Y 1 -Xaal.Xaa2-Xaa3-Xaa4-Xaa5-Xaa6.Xaa7-Xaa8-Xaa9-Y 2 

wherein one or more amino acids, selected from the group consisting of Xaal, Xaal -Xaa2, 
Xaa9, Xaa-8-Xaa9 and Xaa7-Xaa8-Xaa9, can be present or absent, 

wherein if Xaal or Xaal-Xaa2 are absent, Yi is hydrogen or an amino-derivative 
group, 

or if Xaal-Xaa2 is present, Yj is hydrogen, an amino-derivative group or a 

peptide having a formula (Xaa) a wherein Xaa is any amino acid and a 
is an integer from 1-1 5 inclusive, 
if Xaa9, Xaa8-Xaa9 or Xaa7-Xaa8-Xaa9 are absent, Y 2 is hydrogen or a 
15 carboxy-derivative group, 

or if Xaa7-Xaa8-Xaa9 is present, Y 2 is hydrogen, a carboxy-derivative group or a 
peptide having a formula (Xaa)b wherein Xaa is any amino acid and b 
is an integer from 1-15 inclusive, 
Xaal, if present, is Arg, His, Asn or Gin, 
20 Xaa2, if present, is Arg, His, Pro, Met, Val or He, 

Xaa3 is Pro, Met or Leu, 
Xaa4 is Met, Gin, Ala^ Asn, Arg or Lys, 
XaaSisSerorThr, 
Xaa6 is Pro, 

25 Xaa7, if present, is Pro, Lys, Met, Ala, Val or Arg, 

Xaa8, if present, is Lys, Arg or Pro, and 
Xaa9, if present, is Lys, Arg, He or Asn. 



40. The substrate of claim 39, wherein the cell cycle control kinase is cyclin 
30 B4>33ttfc2 and Xaal is Arg, Xaa2 is Arg, Xaa3 is Pro or Met, Xaa4 is Met, Xaa7 is Pro or 

Lys, Xaa8 is Lys or Arg and Xaa9 is Lys or lie. 

41. The substrate of claim 39, wherein the cell cycle control kinase is cyclin 
A/p33 CDK 2 and Xaal is Arg or His, Xaa2 is Arg or His, Xaa3 is Pro or Met, Xaa4 is Met or 

35 Gin, Xaa7 is Pro or Met, Xaa8 is Lys or Arg and Xaa9 is Lys. 



42. A pseudosubstrate for a cell cycle control kinase, comprising the peptide of 

claim 39, wherein Xaa5 is any amino acid except Ser or Thr and the pseudosubstrate can bind 
to a cell cycle control kinase but is not phosphorylated by the cell cycle control kinase. 
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43. The pseudosubstrate of claim 42, wherein Xaa5 is selected from the group 

consisting of Ala, Asn and Glu. 

5 44. A protein modified to comprise the substrate for a cell cycle control kinase as 

claimed in claim 39. 

45. A peptide analog of the peptide of claim 39. 



1 0 46. A method for inhibiting phosphorylation of a native substrate by a cell cycle 

control kinase comprising contacting the cell cycle control kinase with an inhibitory amount 
of the substrate for the cell cycle control kinase as claimed in claim 39 to competitively 
inhibit phosphorylation of a native substrate by the cell cycle control kinase. 



1 5 47. A substrate for a src family kinase, comprising a peptide comprising an amino 

acid sequence: 

Y r Xaal -Xaa2-Xaa3-Xaa4-Xaa5.Xaa6-Xaa7-Xaa8-Xaa9-Y 2 

20 wherein one or more amino acids, selected from the group consisting of Xaal , Xaal -Xaa2, 
Xaal-Xaa2-Xaa3, Xaa9 and Xaa-8-Xaa9, can be present or absent, 

wherein if Xaal, Xaal-Xaa2 or Xaal-Xaa2-Xaa3 are absent, Yj is hydrogen or an 
amino-derivative group, 

25 or if Xaal-Xaa2-Xaa3 is present, Y\ is hydrogen, an ammo-derivative group or a 

peptide having a formula (Xaa) a wherein Xaa is any amino acid and a 
is an integer from 1-15 inclusive, 
if Xaa9 or Xaa8-Xaa9 are absent, Y 2 is hydrogen or a carboxy-derivative 
group, 

30 or if Xaa8-Xaa9 is present, Y 2 is hydrogen, a carboxy-derivative group or a 

peptide having a formula (Xaa)fc wherein Xaa is any amino acid and b 
is an integer from 1-15 inclusive, 
Xaal , if present, is Glu or Asp, 
Xaa2, if present, is Glu or Asp, 
35 Xaa3, if present, is Glu, Asp, Gly or Asn, 

Xaa4 is He, Val, Leu or Pro, 
XaaSisTyr, 

Xaa6 is Gly, Glu or Asp, 
Xaa7 is Glu or Asp, 
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Xaa8, if present, is Phe, He, Leu, Val or Met, and 
Xaa9, if present, is Glu, Phe or Asp. 

48. The substrate of claim 47, wherein the src family kinase is ppeo 0 " 81 " 0 and X*a3 
5 is Glu, Asp or Gly, Xaa4 is He, Val or Leu, Xaa6 is Gly or Glu, Xaa7 is Glu, Xaa8 is Hie or 

lie and Xaa9 is Phe. 

49. The substrate of claim 47, wherein the src family kinase is ppeO^STC and 
Xaa3 is Glu, Asp or Gly, Xaa4 is lie, Val or Leu, Xaa6 is Gly, Glu or Asp, Xaa7 is Gly, XaaS 
is Phe and Xaa9 is Glu or Asp. 

50. A pseudosubstrate for a src family kinase, comprising the peptide of claim 47, 
wherein XaaS is any amino acid except Tyr and the pseudosubstrate can bind to a src family 
kinase but is not phosphorylated by the src family kinase. 

5 1 . The pseudosubstrate of claim 50, wherein XaaS is Phe or His. 

52. A protein modified to comprise the substrate for a src family kinase as claimed 
in claim 47. 

53. A peptide analog of the peptide of claim 47. 

54. A method for inhibiting phosphorylation of a native substrate by a src family 
kinase comprising contacting the src family kinase with an inhibitory amount of the substrate 
for the src family kinase as claimed in claim 47 to competitively inhibit phosphorylation of a 
native substrate by the src family kinase. 

55. A substrate for the EGF receptor, comprising a peptide comprising an amino 
acid sequence: 

Y r Xaal-Xaa2-Xaa3«Xaa4-Xaa5-Xaa6-Xaa7.Xaa8-Xaa9.Y 2 

wherein one or more amino acids, selected from the group consisting of Xaal, Xaa9 and 
Xaa-8-Xaa9, can be present or absent, 

wherein if Xaal is absent, Y\ is hydrogen or an amino-derivative group, 

or if Xaal is present, Yj is hydrogen, an ammo-derivative group or a peptide 
having a formula (Xaa) a wherein Xaa is any amino acid and a is an 
integer from 1-15 inclusive, 
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if Xaa9 or Xaa8-Xaa9 are absent, Y 2 is hydrogen or a carboxy-derivative 
group, 

or if Xaa8-Xaa9 is present, Y 2 is hydrogen, a carboxy-derivative group or a 

peptide having a formula (Xaa)b wherein Xaa is any amino acid and b 
5 is an integer from 1-15 inclusive, 

Xaal, if present, is Glu, Asp, Arg or Ala, 
Xaa2 is Glu or Asp, 
Xaa3 is Glu, Asp Ala, Gin or Pro, 
Xaa4 is Glu, Asp, He, Asn or Val, 
10 Xaa5 is Tyr, 

Xaa6 is Phe, Val, He, Glu, Met or Leu, 

Xaa7 is Glu, Phe, Asp, Val or He, 

Xaa8, if present, is Leii, He, Phe, Val or Met, and 

Xaa9, if present, is Val, Phe, Pro, Glu, Leu or lie. 

15 

56. The peptide of claim 55, wherein Xaa3 is Glu or Asp, Xaa4 is Glu, Asp or He, 

Xaa6 is Phe, Val, He or Glu, Xaa7 is Glu, Phe or Asp, Xaa8 is Leu, lie, Phe or Val, and Xaa9 
is Val. 

20 57. A pseudosubstrate for the EGF receptor, comprising the peptide of claim 55, 

wherein Xaa5 is any amino acid except Tyr and the pseudosubstrate can bind to the EGF 
receptor but is not phosphorylated by the EGF receptor. 



25 



30 



35 



58. The pseudosubstrate of claim 55, wherein Xaa5 is Phe or His. 

59. A protein modified to comprise the substrate for the EGF receptor as claimed 
in claim 55. 

60. A peptide analog of the peptide of claim 55. 

61 . A method for inhibiting phosphorylation of a native substrate by the EGF 
receptor comprising contacting the EGF receptor with an inhibitory amount of the substrate 
for the EGF receptor as claimed in claim 55 to competitively inhibit phosphorylation of a 
native substrate by the EGF receptor. 

62. A substrate for pp92 c - f P s/fes , comprising a peptide comprising an amino acid 
sequence: 

Yj -Xaal -Xaa2-Xaa3-Xaa4-Xaa5-Xaa6-Xaa7-Xaa8-Xaa9-Y 2 
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wherein one or more amino acids, selected from the group consisting of Xaal, Xaal-Xaa2, 
Xaal-Xaa2-Xaa3, Xaa9, and Xaa8-Xaa9, can be present or absent, 



wherein if Xaal, Xaal-Xaa2 or Xaal-Xaa2-Xaa3 are absent, Yj is hydrogen or an 
5 amino-derivative group, 

or if Xaal-Xaa2-Xaa3 is present, Yj is hydrogen, an amino-derivative group or a 
peptide having a formula (Xaa) a wherein Xaa is any amino acid and a 
is an integer from 1-15 inclusive, 
if Xaa9 or Xaa8-Xaa9 are absent, Y 2 is hydrogen or a carboxy-derivative 
10 group, 

or if Xaa8-Xaa9 is present, Y2 is hydrogen, a carboxy-derivative group or a 

peptide having a formula (Xaa)b wherein Xaa is any amino acid and b 
is an integer from 1-15 inclusive, 
Xaal, if present, is Glu, Ala or Asp, 
1 5 Xaa2, if present, is Glu or Ala, 

Xaa3, if present, is Glu, Ala or Gly, 
Xaa4 is He, Glu, Val, Asp or Leu, 
Xaa5 is Tyr, 
Xaa6 is Glu, Asp or Ala, 
20 Xaa7 is Glu, Asp, lie or Val, 

Xaa8, if present, is He, Val, Glu, Met or Leu, and 
Xaa9, if present, is Glu, He, Val, Asp or Gly. 



63. The peptide of claim 62, wherein Xaa3 is Glu or Ala, Xaa4 is He, Glu or Val, 
25 Xaa6 is Glu or Asp, Xaa7 is Glu or Asp, Xaa8 is He, Val or Glu and Xaa9 is Glu, lie, Val or 

Asp. 

64. A pseudosubstrate for pp92 c -*P s/fes , comprising the peptide of claim 62, 
wherein Xaa5 is any amino acid except Tyr and the pseudosubstrate can bind to pp92 c "*P s 'fe s 

30 but is not phosphorylated by pp92 c ~ f P s ' fes . 

65. The pseudosubstrate of claim 64, wherein Xaa5 is Phe or His. 

66. A protein modified to comprise the substrate for pp92 c -ft> s/ fe s as claimed in 
35 claim 62. 



67. 



A peptide analog of the peptide of claim 62. 
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68. A method for inhibiting phosphorylation of a native substrate by pp92 c "^ps/fes 
comprising contacting pp92 c "*P s/ f es with an inhibitory amount of the substrate for 
pp92 c -*P s/fes as claimed in claim 62 to competitively inhibit phosphorylation of a native 
substrate by pp92 c -*P s/fe s 

5 

69. A substrate for Ick, comprising a peptide comprising an amino acid sequence: 
Yj -Xaal -Xaa2-Xaa3-Xaa4-Xaa5-Xaa6-Xaa7-Xaa8-Xaa9- Y 2 

10 wherein one or more amino acids, selected from the group consisting of Xaal , Xaal -Xaa2, 
Xaal -Xaa2-Xaa3, Xaa9, Xaa8-Xaa9 and Xaa7-Xaa8-Xaa9, can be present or absent, 

wherein if Xaal , Xaal -Xaa2 or Xaal-Xaa2-Xaa3 are absent, Yj is hydrogen or an 
amino-derivative group, 

15 or if Xaal-Xaa2-Xaa3 is present, Yj is hydrogen, an amino-derivative group or a 

peptide having a formula (Xaa) a wherein Xaa is any amino acid and a 
is an integer from 1-15 inclusive, 
if Xaa9, Xaa8-Xaa9 or Xaa7-Xaa8-Xaa9 are absent, Y 2 is hydrogen or a 
carboxy-derivative group, 
20 or if Xaa7-Xaa8-Xaa9 is present, Y 2 is hydrogen, a carboxy-derivative group or a 

peptide having a formula (Xaa)^ wherein Xaa is any amino acid and b 
is an integer from 1-15 inclusive, 
Xaal, if present, is any amino acid, 
Xaa2, if present, is Glu, Gly, Pro or Ala, 
25 Xaa3, if present, is any amino acid, 

Xaa4 is He, Val, Leu, Phe or Met, 
Xaa5 is Tyr, 

Xaa6 is Gly, Ala, Met, Glu or Val, 
Xaa7, if present, is Val, Ala, Pro, Met, He, Phe, Lys or Leu, 
30 Xaa8, if present, is Leu, Val, Phe, He or Met, 

Xaa9, if present, is Phe, Leu, Val, He, Pro, Met, Lys or Gly. 

70. The peptide of claim 69, wherein Xaa2 is Glu, Xaa4 is He, Val, Leu or Phe, 

Xaa6 is Gly or Ala, Xaa7 is Val, Xaa8 is Leu, Val, Phe or He, and Xaa9 is Phe, Leu, Val or 
35 He. 

71 - A pseudosubstrate for Ick, comprising the peptide of claim 69, wherein Xaa5 

is any amino acid except Tyr and the pseudosubstrate can bind to Ick but is not 
phosphorylated by Ick. 
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72. The pseudosubstrate of claim 71, wherein Xaa5 is Phe or His. 

73. A protein modified to comprise the substrate for lck as claimed in claim 69. 

5 

74. A peptide analog of the peptide of claim 69. 

75. A method for inhibiting phosphorylation of a native substrate by lck 
comprising contacting lck with an inhibitory amount of the substrate for lck as claimed in 

10 claim 69 to competitively inhibit phosphorylation of a native substrate by lck. 

76. A substrate for c-abl, comprising a peptide comprising an amino acid 
sequence: 

1 5 Yj -Xaal -Xaa2-Xaa3-Xaa4-Xaa5-Xaa6-Xaa7-Xaa8-Xaa9-Y 2 

wherein one or more amino acids, selected from the group consisting of Xaal, Xaal-Xaa2, 
Xaal-Xaa2-Xaa3, Xaa9, Xaa8-Xaa9 and Xaa7-Xaa8-Xaa9, can be present or absent, 

20 wherein if Xaal , Xaal -Xaa2 or Xaal -Xaa2-Xaa3 are absent, Yj is hydrogen or an 

amino-derivative group, 
or if Xaal-Xaa2-Xaa3 is present, Yj is hydrogen, an amino-derivative group or a 
peptide having a formula (Xaa) a wherein Xaa is any amino acid and a 
is an integer from 1-15 inclusive, 
25 if Xaa9, Xaa8-Xaa9 or Xaa7-Xaa8-Xaa9 are absent, Y 2 is hydrogen or a 

carboxy-derivative group, 
or if Xaa7-Xaa8-Xaa9 is present, Y 2 is hydrogen, a carboxy-derivative group or a 
peptide having a formula (Xaa)fc wherein Xaa is any amino acid and b 
is an integer from 1-15 inclusive, 
30 Xaal , if present, is Ala, Arg, Gly or Pro, 

Xaa2, if present, is any amino acid, 
Xaa3 , if present, is Val, Ala, Pro, He, Gly or Glu, 
Xaa4 is lie, Val, Leu or Met, 
Xaa5 is Tyr, 

35 Xaa6 is Ala, Met, Gin, Gly or Phe, 

Xaa7, if present, is Ala, Val, Phe, Met, Gin, Leu or lie, 
Xaa8, if present, is Pro, Phe, Leu, He, Val or Ala, and 
Xaa9, if present, is Phe, Leu, lie, Met, Gly or Pro. 
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77. The peptide of claim 76, wherein Xaal is Ala, Xaa3 is Val, Xaa4 is He, Val or 
Leu, Xaa6 is Ala or Met, Xaa7 is Ala, Xaa8 is Pro or Phe and Xaa9 is Phe. 

78. A pseudosubstrate for c-abl, comprising the peptide of claim 76, wherein Xaa5 
5 is any amino acid except Tyr and the pseudosubstrate can bind to c-abl but is not 

phosphorylated by c-abl. 

79. The pseudosubstrate of claim 78, wherein Xaa5 is Phe or His. 

10 80. A protein modified to comprise the substrate for c-abl as claimed in claim 76. 

81. A peptide analog of the peptide of claim 76. 

82. A method for inhibiting phosphorylation of a native substrate by c-abl 

15 comprising contacting c-abl with an inhibitory amount of the substrate for c-abl as claimed in 
claim 76 to competitively inhibit phosphorylation of a native substrate by c-abl. 

83. A substrate for platelet-derived growth factor receptor (PDGFR), comprising a 
peptide comprising an amino acid sequence: 



20 



Y r Xaal-Xaa2-Xaa3-Xaa4-Xaa5-Xaa6-Xaa7-Xaa8-Xaa9-Y 2 

wherein one or more amino acids, selected from the group consisting of Xaal, Xaal-Xaa2, 
Xaal-Xaa2-Xaa3, Xaa9, Xaa8-Xaa9 and Xaa7-Xaa8-Xaa9, can be present or absent, 



25 



wherein if Xaal, Xaal-Xaa2 or Xaal-Xaa2-Xaa3 are absent, Yj is hydrogen or an 
amino-derivative group, 
or if Xaal-Xaa2-Xaa3 is present, Yj is hydrogen, an amino-derivative group or a 
peptide having a formula (Xaa) a wherein Xaa is any amino acid and a 
30 is an integer from 1-15 inclusive, 

if Xaa9, Xaa8-Xaa9 or Xaa7-Xaa8-Xaa9 are absent, Y 2 is hydrogen or a 
carboxy-derivative group, 
or if Xaa7-Xaa8-Xaa9 is present, Y 2 is hydrogen, a carboxy-derivative group or a 
peptide having a formula (Xaa)fc wherein Xaa is any amino acid and b 
35 is an integer from 1-15 inclusive, 

Xaal , if present, is Glu or Asp, 
Xaa2, if present, is Glu or Asp, 
Xaa3, if present, is Glu or Asp, 
Xaa4 is Glu or Asp, 
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10 



15 



Xaa5 is Tyr, 

Xaa6is Val, Glu or He, 

Xaa7, if present, is Phe, 

Xaa8, if present, is He, Val or Phe, and 

Xaa9, if present, is any amino acid. 

84. A pseudosubstrate for PDGFR, comprising the peptide of claim 83, wherein 
Xaa5 is any amino acid except Tyr and the pseudosubstrate can bind to PDGFR but is not 
phosphorylated by PDGFR. 

85. The pseudosubstrate of claim 84, wherein Xaa5 is Phe or His. 

86. A protein modified to comprise the substrate for PDGFR as claimed in 
claim 83. 

87. A pepide analog of the peptide of claim 83. 



88. A method for inhibiting phosphoiylation of a native substrate by PDGFR 
comprising contacting PDGFR with an inhibitory amount of the substrate for PDGFR as 

20 claimed in claim 83 to competitively inhibit phosphorylation of a native substrate by PDGFR. 

89. A substrate for fibroblast growth factor receptor (FGFR), comprising a peptide 
comprising an amino acid sequence: 

25 Y r Xaal-Xaa2-Xaa3-Xaa4.Xaa5-Xaa6-Xaa7-Xaa8-Xaa9-Y 2 

wherein one or more amino acids, selected from the group consisting of Xaal, Xaal-Xaa2, 
Xaal-Xaa2-Xaa3, Xaa9, Xaa8-Xaa9 and Xaa7-Xaa8-Xaa9, can be present or absent, 

30 wherein if Xaal, Xaal-Xaa2 or Xaal-Xaa2-Xaa3 are absent, Y, is hydrogen or an 

amino-derivative group, 
or if Xaal -Xaa2-Xaa3 is present, Y] is hydrogen, an amino-derivative group or a 
peptide having a formula (Xaa) a wherein Xaa is any amino acid and a 
is an integer from 1-15 inclusive, 
35 if Xaa9, Xaa8-Xaa9 or Xaa7-Xaa8-Xaa9 are absent, Y 2 is hydrogen or a 

carboxy-derivative group, 
or if Xaa7-Xaa8-Xaa9 is present, Y 2 is hydrogen, a carboxy-deri vati ve group or a 
peptide having a formula (Xaa)fc wherein Xaa is any amino acid and b 
is an integer from 1-15 inclusive, 
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Xaal, if present, is Ala, Lys, Glu, Phe, Pro or Gly, 
Xaa2, if present, is Glu, Ala, Phe, Lys, Met, Gly, He or Val, 
Xaa3, if present, is Glu, Lys, Ala, Phe, Pro, Gly or Val, 
Xaa4 is Glu, lie, Val, Asp, Phe, Ala or Leu, 
5 Xaa5 is Tyr, 

Xaa6 is Phe, Val, Glu, He, Met, Ala or Lys, 
Xaa7, if present, is Phe, Val, Met, He, Leu, Glu, Pro or Lys, 
Xaa8, if present, is Leu, Phe, Met, He, Val or Glu, 
Xaa9, if present, is Phe, Leu, Met, Val, Glu, He or Pro. 

10 

90. The peptide of claim 89, wherein Xaal is Ala, Xaa2 is Glu or Ala, Xaa3 is 

Glu, Xaa4 is Glu, Xaa6 is Phe or Val, Xaa7 is Phe, Xaa8 is Leu, Phe, Met, lie or Val and 
Xaa9 is Phe. 

15 91. A pseudosubstrate for FGFR, comprising the peptide of claim 89, wherein 

Xaa5 is any amino acid except Tyr and the pseudosubstrate can bind to FGFR but is not 
phosphorylated by FGFR. 

92. The pseudosubstrate of claim 91 , wherein Xaa5 is Phe or His. 

20 

93. A protein modified to comprise the substrate for FGFR as claimed in claim 89. 

94. A peptide analog of the peptide of claim 89. 

25 95. A method for inhibiting phosphorylation of a native substrate by FGFR 

comprising contacting FGFR with an inhibitory amount of the substrate for FGFR as claimed 
in claim 89 to competitively inhibit phosphorylation of a native substrate by FGFR. 

96. A substrate for insulin receptor, comprising a peptide comprising an amino 

30 acid sequence: 

Y r Xaal-Xaa2-Xaa3-Xaa4-Xaa5-Xaa6-Xaa7-Xaa8-Xaa9-Y 2 

wherein one or more amino acids, selected from the group consisting of Xaal, Xaal-Xaa2, 
35 Xaal -Xaa2-Xaa3, Xaa9, Xaa8-Xaa9 and Xaa7-Xaa8-Xaa9, can be present or absent, 



wherein if Xaal , Xaal -Xaa2 or Xaal -Xaa2-Xaa3 are absent, Yj is hydrogen or an 
amino-derivative group, 
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or if Xaal-Xaa2-Xaa3 is present, Yj is hydrogen, an amino-derivative group or a 
peptide having a formula (Xaa) a wherein Xaa is any amino acid and a 
is an integer from 1-15 inclusive, 
if Xaa9, Xaa8-Xaa9 or Xaa7-Xaa8-Xaa9 are absent, Y 2 is hydrogen or a 
5 carboxy-derivative group, 

or if Xaa7-Xaa8-Xaa9 is present, Y 2 is hydrogen, a carboxy-derivative group or a 
peptide having a formula (Xaa)fc wherein Xaa is any amino acid and b 
is an integer from 1-15 inclusive, 
Xaal , if present, is any amino acid, 
10 Xaa2, if present, is Glu, Gly, Ala, Phe, Asn or Met, 

Xaa3, if present, is Glu, Ala, Gly, Pro or Asn, 
Xaa4 is Glu, Asn, Asp, Ala, He, Pro or Gly, 
Xaa5 is Tyr, 

Xaa6 is Met, Phe, Ala, Glu or Val, 
1 5 Xaa7, if present, is Met, Phe, Val, Glu, Ala, Asp, Leu or He, 

Xaa8, if present, is Met, Phe, He, Glu or Val, 
Xaa9, if present, is Met, Phe, Glu, Val, Leu, He or Gly. 

97. The peptide of claim 96, wherein Xaa2 is Glu, Xaa3 is Glu, Xaa4 is Glu, Asn 

20 or Asp, Xaa6 is Met or Phe, Xaa7 is Met or Phe, Xaa8 is Met, Phe, He or Glu and Xaa9 is 
Met, Phe or Glu. 



98. A pseudosubstrate for insulin receptor, comprising the peptide of claim 96, 
wherein Xaa5 is any amino acid except Tyr and the pseudosubstrate can bind to insulin 

25 receptor but is not phosphorylated by insulin receptor. 

99. The pseudosubstrate of claim 98, wherein XaaS is Phe or His. 

100. A protein modified to comprise the substrate for insulin receptor as claimed in 
30 claim 96. 



101. A peptide analog of the peptide of claim 96. 

102. A method for inhibiting phosphorylation of a native substrate by insulin 

35 receptor comprising contacting insulin receptor with an inhibitory amount of the substrate for 
insulin receptor as claimed in claim 96 to competitively inhibit phosphorylation of a native 
substrate by insulin receptor. 
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103. A substrate for casein kinase II, comprising a peptide comprising an amino 

acid sequence: 



Y r Xaal-Xaa2-Xaa3-Xaa4-Xaa5-Xaa6-Xaa7-Xaa8-Xaa9-Y 



2 



wherein one or more amino acids, selected from the group consisting of Xaal, Xaal-Xaa2, 
Xaal-Xaa2-Xaa3, Xaa9, Xaa8-Xaa9 and Xaa7-Xaa8-Xaa9, can be present or absent, 

wherein if Xaal , Xaal -Xaa2 or Xaal -Xaa2-Xaa3 are absent, Yj is hydrogen or an 
10 amino-derivative group, 

or if Xaal-Xaa2-Xaa3 is present, Yj is hydrogen, an amino-derivative group or a 
peptide having a formula (Xaa) a wherein Xaa is any amino acid and a 
is an integer from 1-15 inclusive, 
if Xaa9, Xaa8-Xaa9 or Xaa7-Xaa8-Xaa9 are absent, Y 2 is hydrogen or a 
15 carboxy-derivative group, 

or if Xaa7-Xaa8-Xaa9 is present, Y 2 is hydrogen, a carboxy-derivative group or a 
peptide having a formula (Xaa)b wherein Xaa is any amino acid and b 
is an integer from 1-15 inclusive, 
Xaal , if present, is Glu, Asp, Ala or Gly, 
20 Xaa2, if present, is Asp, Glu, Gly or Ala, 

Xaa3, if present, is Glu, Asp, Ala or Gly, 
Xaa4 is Glu, Asp, Gly, Ala or Met, 
Xaa5 is Ser or Thr, 
Xaa6 is Glu, Asp or Ala, 
25 Xaa7, if present, is Asp, Glu or Ala, 

Xaa8, if present, is Glu, Asp or Ala, 
Xaa9, if present, is Glu, Asp or He. 

104. The peptide of claim 103, wherein Xaal is Glu, Asp or Ala, Xaa2 is Asp or 
30 Glu, Xaa3 is Glu or Asp, Xaa4 is Glu or Asp, Xaa6 is Glu or Asp, Xaa7 is Asp or Glu, Xaa8 

is Glu or Asp and Xaa9 is Glu or Asp. 

105. A pseudosubstrate for casein kinase II, comprising the peptide of claim 103, 
wherein Xaa5 is any amino acid except Ser or Thr and the pseudosubstrate can bind to casein 

35 kinase II but is not phosphorylated by casein kinase II. 



106. 



The pseudosubstrate of claim 105, wherein Xaa5 is Ala, Asn or Glu. 
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107. A protein modified to comprise the substrate for casein kinase II as claimed in 
claim 103. 

1 08. A peptide analog of the peptide of claim 1 03. 

5 

109. A method for inhibiting phosphorylation of a native substrate by casein kinase 
II comprising contacting casein kinase II with an inhibitory amount of the substrate for casein 
kinase II as claimed in claim 103 to competitively inhibit phosphorylation of a native 
substrate by casein kinase II. 

10 

110. A substrate for NIMA kinase, comprising a peptide comprising an amino acid 
sequence: 

Yj -Xaal -Xaa2-Xaa3-Xaa4-Xaa5-Xaa6-Xaa7-Xaa8-Xaa9-Y 2 

15 

wherein one or more amino acids, selected from the group consisting of Xaal, Xaal-Xaa2, 
Xaal-Xaa2-Xaa3, Xaa9, Xaa8-Xaa9 and Xaa7-Xaa8-Xaa9, can be present or absent, 

wherein if Xaal, Xaal-Xaa2 or Xaal-Xaa2-Xaa3 are absent, Yj is hydrogen or an 
20 amino-derivative group, 

or if Xaal-Xaa2-Xaa3 is present, Yj is hydrogen, an amino-derivative group or a 
peptide having a formula (Xaa) a wherein Xaa is any amino acid and a 
is an integer from 1-15 inclusive, 
if Xaa9, Xaa8-Xaa9 or Xaa7-Xaa8-Xaa9 are absent, Y 2 is hydrogen or a 
25 carboxy-derivative group, 

or if Xaa7-Xaa8-Xaa9 is present, Y 2 is hydrogen, a carboxy-derivative group or a 
peptide having a formula (Xaa)^ wherein Xaa is any amino acid and b 
is an integer from 1-15 inclusive, 
Xaal, if present, is Arg, Asn, Phe, Met, Gin or Gly, 
30 Xaa2, if present, is Phe, Leu or Met, 

Xaa3, if present, is Arg, Lys or Asn, 
Xaa4 is Arg, Lys, Met or Gin, 
XaaSis SerorThr, 

Xaa6 is Arg, He, Val, Met, Leu or Gin, 
35 Xaa7, if present, is Arg, He, Met, Val, Phew Leu, 

Xaa8, if present, is Met, He, Phe, Val, Leu or Gly, 
Xaa9, if present, is He, Phe, Met, Gly, Leu or Val. 
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111. The peptide of claim 1 10, wherein Xaal is Arg or Asn, Xaa3 is Arg, Xaa4 is 

Arg or Lys, Xaa 6 is Arg, He, Val or Met, Xaa7 is Arg, He, Met, Val or Phe, Xaa8 is Met, He, 
Phe or Val, and Xaa9 is He, Phe or Met. 

5 112. A pseudosubstrate for NIMA kinase, comprising the peptide of claim 1 1 0, 

wherein Xaa5 is any amino acid except Ser or Thr and the pseudosubstrate can bind to NIMA 
kinase but is not phosphorylated by NIMA kinase. 

113. The pseudosubstrate of claim 1 1 2, wherein Xaa5 is Ala, Asn or Glu. 

10 

114. A protein modified to comprise the substrate for NIMA kinase as claimed in 
claim 110. 



115. A peptide analog of the peptide of claim 110. 

15 

116. A method for inhibiting phosphorylation of a native substrate by NIMA kinase 
comprising contacting NIMA kinase with an inhibitory amount of the substrate for NIMA 
kinase as claimed in claim 1 10 to competitively inhibit phosphorylation of a native substrate 
by NIMA kinase. 

20 

117. A substrate for phosphorylase kinase, comprising a peptide comprising an 
amino acid sequence: 



25 



Y r Xaal-Xaa2-Xaa3-Xaa4-Xaa5-Xaa6-Xaa7-Xaa8-Xaa9-Y 2 

wherein one or more amino acids, selected from the group consisting of Xaal, Xaal-Xaa2, 
Xaal-Xaa2-Xaa3, Xaa9, Xaa8-Xaa9 and Xaa7-Xaa8-Xaa9, can be present or absent, 



wherein if Xaal , Xaal -Xaa2 or Xaal -Xaa2-Xaa3 are absent, Yj is hydrogen or an 
30 amino-derivative group, 

or if Xaal-Xaa2-Xaa3 is present, Yj is hydrogen, an amino-derivative group or a 
peptide having a formula (Xaa) a wherein Xaa is any amino acid and a 
is an integer from 1-15 inclusive, 
if Xaa9, Xaa8-Xaa9 or Xaa7-Xaa8-Xaa9 are absent, Y 2 is hydrogen or a 
35 carboxy-derivative group, 

or if Xaa7-Xaa8-Xaa9 is present, Y 2 is hydrogen, a carboxy-derivative group or a 
peptide having a formula (Xaa)b wherein Xaa is any amino acid and b 
is an integer from 1-15 inclusive, 
Xaal, if present, is Phe, Met, Lys, He, His or Val, 
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Xaa2, if present, is Arg or Lys, 
Xaa3, if present, is Met, Arg, Gin, Phe, Lys or Leu, 
Xaa4 is Met, Phe, Leu, He, Lys or Val, 
Xaa5 is SerorThr, 
5 Xaa6 is Phe, He, Met, Leu or Val, 

Xaa7, if present, is Phe, Arg, Lys, Met or He, 
Xaa8, if present, is Leu, He, Met, Pro or Val, 
Xaa9, if present, is Phe, Leu, He, Val, Pro or Met. 



10 118. The peptide of claim 1 1 7, wherein Xaal is Phe or Met, Xaa2 is Arg or Lys, 

Xaa3 is Met, Arg, Gin or Phe, Xaa4 is Met, Phe, Leu or lie, Xaa6 is Phe, He, Met or Leu, 
Xaa7 is Phe, Arg or Lys, Xaa8 is Leu or lie and Xaa9 is Phe, Leu or He. 

1 1 9 * A pseudosubstrate for phosphorylase kinase, comprising the peptide of claim 

15 117, wherein Xaa5 is any amino acid except Ser or Thr and the pseudosubstrate can bind to 
phosphorylase kinase but is not phosphoiylated by phosphorylase kinase. 

120. The pseudosubstrate of claim 1 19, wherein Xaa5 is Ala, Asn or Glu. 

20 121. A protein modified to comprise the substrate for phosphorylase kinase as 

claimed in claim 117. 



122. A peptide analog of the peptide of claim 1 1 7. 

25 123. A method for inhibiting phosphorylation of a native substrate by 

phosphorylase kinase comprising contacting phosphorylase kinase with an inhibitory amount 
of the substrate for phosphorylase kinase as claimed in claim 1 17 to competitively inhibit 
phosphorylation of a native substrate by phosphorylase kinase. 

30 1 24. A substrate for Cam kinase II, comprising a peptide comprising an amino acid 

sequence: 



Y,-Xaal-Xaa2-Xaa3-Xaa4.Xaa5-Xaa6-Xaa7-Xaa8-Xaa9-Y 2 

35 wherein one or more amino acids, selected from the group consisting of Xaal, Xaal-Xaa2, 
Xaal-Xaa2-Xaa3, Xaa9, Xaa8-Xaa9 and Xaa7-Xaa8-Xaa9, can be present or absent, 

wherein if Xaal, Xaal -Xaa2 or Xaal-Xaa2-Xaa3 are absent, Y, is hydrogen or an 
amino-derivative group, 
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or if Xaal-Xaa2-Xaa3 is present, Yj is hydrogen, an ammo-derivative group or a 
peptide having a formula (Xaa) a wherein Xaa is any amino acid and a 
is an integer from 1-15 inclusive, 
if Xaa9, Xaa8-Xaa9 or Xaa7-Xaa8-Xaa9 are absent, Y 2 is hydrogen or a 
5 carboxy-derivative group, 

or if Xaa7-Xaa8-Xaa9 is present, Y 2 is hydrogen, a carboxy-derivative group or a 
peptide having a formula (Xaa)b wherein Xaa is any amino acid and b 
is an integer from 1-15 inclusive, 
Xaal , if present, is Lys, Phe, Asp, His or Met, 
1 0 Xaa2, if present, is Arg, Lys, Phe or Met, 

Xaa3, if present, is Gin, Met, Lys, Ala or Phe, 
Xaa4 is Gin, Met, Lys, Leu, Phe or Ala, 
Xaa5 is Ser or Thr, 
Xaa6 is Phe, He, Met, Leu or Val, 
15 Xaa7, if present, is Asp, Glu, He, Met, Lys, Val, or Leu, 

Xaa8, if present, is Leu, Met, Lys, He, Val, Gin or Gly, 
Xaa9, if present, is Phe, Lys, Met, Val, Leu, He or Pro. 



125. The peptide of claim 124, wherein Xaal is Lys or Phe, Xaa2 is Arg or Lys, 
20 Xaa3 is Gin or Met, Xaa4 is Gin, Met, Lys, Leu or Phe, Xaa6 is Phe, He, Met or Leu, Xaa7 is 

Asp, Glu or He, Xaa8 is Leu, Met, Lys or He and Xaa9 is Phe. 

126. A pseudosubstrate for Cam kinase II, comprising the peptide of claim 124, 
wherein Xaa5 is any amino acid except Ser or Hit and the pseudosubstrate can bind to Cam 

25 kinase H but is not phosphorylated by Cam kinase II. 

1 27. The pseudosubstrate of claim 126, wherein Xaa5 is Ala, Asn or Glu. 

128. A protein modified to comprise the substrate for Cam kinase n as claimed in 
30 claim 124. 

129. A peptide analog of the peptide of claim 124. 

1 30. A method for inhibiting phosphorylation of a native substrate by Cam kinase 
35 II comprising contacting Cam kinase II with an inhibitory amount of the substrate for Cam 

kinase II as claimed in claim 124 to competitively inhibit phosphorylation of a native 
substrate by Cam kinase II. 
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131. A substrate for Erk 1 , comprising a peptide comprising an amino acid 
sequence: 

Yj -Xaal -Xaa2-Xaa3-Xaa4-Xaa5-Xaa6-Xaa7-Xaa8-Xaa9-Y 2 

wherein one or more amino acids, selected from the group consisting of Xaal, Xaal-Xaa2, 
Xaal-Xaa2-Xaa3, Xaa9, Xaa8-Xaa9 and Xaa7-Xaa8-Xaa9, can be present or absent, 

wherein if Xaal, Xaal -Xaa2 or Xaal-Xaa2-Xaa3 are absent, Yj is hydrogen or an 
10 amino-derivative group, 

or if Xaal-Xaa2-Xaa3 is present, Y\ is hydrogen, an amino-derivative group or a 
peptide having a formula (Xaa) a wherein Xaa is any amino acid and a 
is an integer from 1-15 inclusive, 
if Xaa9, Xaa8-Xaa9 or Xaa7-Xaa8-Xaa9 are absent, Y 2 is hydrogen or a 
carboxy-derivative group, 
or if Xaa7-Xaa8-Xaa9 is present, Y 2 is hydrogen, a carboxy-derivative group or a 
peptide having a formula (Xaa)b wherein Xaa is any amino acid and b 
is an integer from 1-15 inclusive, 
Xaal, if present, is Thr, Pro, Ser, Gin, He, Val, Asn or Glu, 
Xaa2, if present, is Gly, Pro, Glu, Tyr or Gin, 
Xaa3, if present, is Pro, Leu, He or Val, 
Xaa4 is Leu, Met, Pro, Lys, Gin or Thr, 
Xaa5 is Ser or Thr, 
Xaa6 is Pro, 

Xaa7, if present, is Gly, Pro, Phe, Thr or Leu, 
Xaa8, if present, is Pro, Phe, Gly, Tyr, Leu or He, 
Xaa9, if present, is Phe, Tyr, He or Leu. 

1 32. The peptide of claim 131, wherein Xaal is Thr or Pro, Xaa2 is Gly, Pro, Glu 
or Tyr, Xaa3 is Pro or Leu, Xaa4 is Leu, Met or Pro, Xaa7 is Gly or Pro, Xaa8 is Pro, Phe, 
Gly or Tyr and Xaa9 is Phe or Tyr. 

133. A pseudosubstrate for Erk 1 , comprising the peptide of claim 131, wherein 
Xaa5 is any amino acid except Ser or Thr and the pseudosubstrate can bind to Erkl but is not 
phosphorylated by Erkl. 

134. The pseudosubstrate ofclaim 133, wherein XaaS is Ala, Asn or Glu. 



135. 



A protein modified to comprise the substrate for Erkl as claimed in claim 131. 
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1 36. A peptide analog of the peptide of claim 131. 

137. A method for inhibiting phosphorylation of a native substrate by Erkl 

5 comprising contacting Erkl with an inhibitory amount of the substrate for Erkl as claimed 
claim 131 to competitively inhibit phosphorylation of a native substrate by Erkl . 
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